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1 Introduction 



In the last years many results in the area of semidefinite programming were 
obtained for invariant semidefinite programs — semidefinite programs which 

r»o , have symmetries. This was done for a variety of problems and applications. 

£> ' The purpose of this handbook chapter is to give the reader the necessary back- 

^~) ground for dealing with semidefinite programs which have symmetry. Here the 

focus is on the basic theory and on representative examples. We do not aim 
at completeness of the presentation. 

In all applications the underlying principles are similar: one simplifies the 
original semidefinite program which is invariant under a group action by ap- 

^^ ' plying an algebra isomorphism mapping a "large" matrix algebra to a "small" 

matrix algebra. Then it is sufficient to solve the semidefinite program using 
the smaller matrices. 

We start this chapter by developing the general framework in the intro- 

^^ ' duction where we give a step-by-step procedure for simplifying semidefinite 

programs: Especially Step 2 (first version), Step 2 (second version), and Step 
l| will be relevant in the later discussion. Both versions of Step 2 are based 
on the main structure theorem for matrix *-algebras. Step l-^ is based on the 
regular ^-representation. 

In Section 2 we give a proof of the main structure theorem for matrix 
*-algebras and we present the regular ^-representation. Strictly speaking the 
framework of matrix ^-algebras is slightly too general for the applications we 
have in mind. However, working with matrix *-algebras does not cause much 
extra work and it also gives a numerical algorithm for finding an explicit 
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algebra isomorphism. Section 2 is mainly concerned with finite dimensional 
invariant scmidefinite programs. In Section 3 we show how one can extend 
this to special classes of infinite dimensional invariant scmidefinite programs, 
namely those which arise from permutation actions of compact groups. We 
focus on this case because of space limitations and because it suffices for 
our examples. This section is connected to Step 2 (second version) of the 
introduction. 

The later sections contain examples coming from different areas: In Sec- 
tion 4 we consider finding upper bounds for finite error-correcting codes and 
in Section 5 we give lower bounds for the crossing number of complete bi- 
partite graphs. Both applications are based on the methods explained in Sec- 
tion 2 (Step 2 (first version) and Step l| in the introduction). In Section 6 
we use Step 2 (second version) for finding upper bounds for spherical codes 
and other geometric packing problems on the sphere. For this application the 
background in Section 3 is relevant. Section 4 and Section 6 both use the theta 
number of Lovasz for finding upper bounds for the independence number of 
highly symmetric graphs. In Section 7 we show how one can exploit symmetry 
in polynomial optimization: We give particular sum of squares representations 
of polynomials which have symmetry. 

This list of applications is not complete, and many more applications can 
be found in the literature. In the last Section 8 we give literature pointers to 
more applications. 

1.1 Complex semidefinite programs 

In order to present the theory as simple as possible we work with complex 
scmidefinite programs. We give the necessary definitions. A complex matrix 
X € <C nxn is Hermitian if X = X* , where X* is the conjugate transpose of 
X , i.e. Xij = Xji. It is called positive semidefinite, we write X >z 0, if for all 
(column) vectors (ai, . . . , a n ) G C" we have 

n n 
t=l j=l 

The space of complex matrices is equipped with a complex inner product, the 
trace product (X,Y) = trace(y*X), which is linear in the first entry. The 
inner product of two Hermitian matrices is always real. 

Definition 1.1 A (complex) semidefinite program is an optimization prob- 
lem of the form 

max{(X, C) : X > 0, {X, A x ) = bi, . . . , (X, A m ) = b m }, (1) 

where A\,...,A m G C nx ™, and C G C nxn are given Hermitian matrices, 
(bi,... ,b m ) G W n is a given vector and X G C nxrl is a variable Hermitian 
matrix. 
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A Hermitian matrix A <G C" x ™ is called a feasible solution of (1) if it 
is positive semidefinite and fulfills all m linear constraints. It is called an 
optimal solution if it is feasible and if for every feasible solutions Y we have 
(X,C)>(Y,C). 

There is an easy reduction from complex semidefinite programs to semidef- 
inite programs involving real matrices only, as noticed by Goemans and 
Williamson [43]. A complex matrix X e c nx ™ defines a real matrix 

/»(*) -$t(X)\ 2nx2n 
\Q(X) fR(X) J 

where Jft(A) <G R nxn and 3(A) <E R nxn are the real and imaginary parts of A. 
Then the properties of being Hermitian and being complex positive semidef- 
inite translate into being symmetric and being real positive semidefinite: We 
have for all real vectors a — (a\, . . . , (X2n) € K 2n : 

\$s(X) K(A) ) ~ 

On the other hand, complex semidefinite programs fit into the framework 
of conic programming (see e.g. Nemirovski [76]). Here one uses the cone of 
positive semidefinite Hermitian matrices instead of the cone of real positive 
scmidefinite matrices. There are implementations available, SeDuMi (Sturm 
[89]) for instance, which can deal with complex semidefinite programs directly. 

1.2 Semidefinite programs invariant under a group action 

Now we present the basic framework for simplifying a complex scmidefinite 
program which has symmetry, i.e. which is invariant under the action of a 
group. 

Let us fix some notation first. Let G be a finite group. Let it : G — > U n (C) 
be a unitary representation of G, that is, a group homomorphism from the 
group G to the group of unitary matrices U n (C). The group G is acting on 
the set of Hermitian matrices by 

{g,A)^n(g)ATr(g)*. 

In general, a (left) action of a group G on a set M is a map 

G x M -)• M, (g, x) H- gx, 

that satisfies the following properties: We have lx = x for all x € M where 
1 denotes the neutral element of G. Furthermore, (gig2){x) = gi(g 2 x) for all 
<7i, gi € G and all x <G M. 

A matrix A is called G-invariant if A = gX for all g € G, and we denote 
the set of all G-invariant matrices by (C™ xn ) G . We say that the semidefinite 
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program (1) is G-invariant if for every feasible solution X and for every g G G 
the matrix gX is again a feasible solution and if it satisfies (gX, C) = {X, C) 
for all g € G. 

One example, which will receive special attention because of its impor- 
tance, is the case of a permutation action: The set of feasible solutions is 
invariant under simultaneous permutations of rows and columns. Let G be a 
finite group which acts on the index set [n] — {1, . . . ,n}. So we can see G 
as a subgroup of the permutation group on [n] . To a permutation a on [n] 
corresponds a matrix permutation P a e U n (C) defined by 



Kcrjtj 



1, iii = a(j), 

0, otherwise. 



and the underlying unitary representation is 7r(cr) = P a . In this case, the 
action on matrices X <G C™ x " is 

a{X) = P a XP*, where a(X)ij = X a -i {ij) = X a -i {i) ^-i (j) . 

In the following, we give some background on unitary representations. 
This part may be skipped at first reading. Let G be a finite group. 
A representation of G is a finite dimensional complex vector space V 
together with a homomorphism n : G — > G\(V) from G to the group 
of invertible linear maps on V . The space V is also called a G-space 
or a G-module and tt may be dropped in notations, replacing n(g)v 
by gv. In other words, the group G acts on V and this action has the 
additional property that g(Xv + g,w) = Xgv + figw for all (A, g) G C 2 
and (v,w) G V 2 . The dimension of a representation is the dimension of 
the underlying vector space V . 

If V is endowed with an inner product (v, w) which is invariant under G, 
i.e. satisfies (gv,gw) — (v,w) for all (v,w) £ V 2 and g € G, V is called a 
unitary representation of G. An inner product with this property always 
exists on V, since it is obtained by taking the average 

(v,w) = T^r^{gv,gw) 
11 sec 

of an arbitrary inner product (v,w)o on V. So, with an appropriate 
choice of a basis of V, any representation of V is isomorphic to one of 
the form it : G — > U n (C), the form in which unitary representations of 
G where defined above. 

Step 1: Restriction to invariant subspace 

Because of the convexity of (1), one can find an optimal solution of (1) in 

the set of G-invariant matrices. In fact, if X is an optimal solution of (1), so 

is its group average t^t J^ gX. Hence, (1) is equivalent to 
gee' 

max{(X, C):XhO, (X, A x ) = bi, • ■ • , (X, A m ) =b m ,Xe (C nxn f}. (2) 
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The G-invariant matrices intersected with the Hcrmitian matrices form a 
vector space. Let B\, . . . , B^ be a basis of this space. Step 1 of simplifying a 
G-invariant semidcfinitc program is rewriting (2) in terms of this basis. 

Step 1: If the semidefinite program (1) is G-invariant, then it is equivalent 
to 

max < (X, C) : xi, . . . , xn G C, 

X = xiB 1 -\ \-x N B N hO, (3) 

(X,Ai) =bi, i = 1, . . .,m\. 

In the case of a permutation action there is a canonical basis of (C" X ™) G 
which one can determine by looking at the orbits of the group action on pairs. 
Then, performing Step 1 using this basis amounts to coupling the variable 
matrix entries of X. 

The orbit of the pair (i,j) € [n] x [n] under the group G is given by 

0(i,j) = {(a(i),a(j)):aGG}. 

The set [n] x [n] decomposes into the orbits R\, . . . , Rm under the action of G 
For every r € {1, . . . , M} we define the matrix G r <G {0, 1}™ X " by (C r )ij = 1 
if (i,j) G R r and (C r )ij = otherwise. Then Ci,...,Cm forms a basis of 
(C™ X ") G , the canonical basis. If (i,j) G R r we also write G^.,] instead of C r . 
Then, Cj iA =G M . 

Note here that C\, . . . , Cm is a basis of (C" X ™) G . In order to get a basis of 
the space of G-invariant Hermitian matrices we have to consider the orbits of 
unordered pairs: We get the basis by setting Buj\ — Cuj-i if (i,j) and (j,i) 
are in the same orbit and Bujx — Cua + Cun if they are in different orbits. 
The matrix entries of X in (3) are constant on the (unordered) orbits of pairs: 
Xij = X a (ij\ = X a (jj\ = Xji for all (i,j) G [n] x [n] and a € G 

Step 2: Reducing the matrix sizes by block diagonalization 

The G-invariant subspace (C" XTl ) G is closed under matrix multiplication. 
This can be seen as follows: Let X,Y e (C nxn ) G and let g e G, then 

g(XY) = n(g)XYTr( g y - (^(. 9 )X7r( 3 )*)(^( 5 )r7r( 3 )*) = (gX)(gY) = XY. 

Moreover, it is also closed under taking the conjugate transpose, since, for 
X G (C" X ") G , Tr(g)X*Tr(g)* = (Tr(g)Xw(g)*)* =X*. Thus (C" xrl ) G has the 
structure of a matrix *-algebra. (However, not all matrix *-algebras are coming 
from group actions.) 

In general, a matrix * -algebra is a set of complex matrices that is closed 
under addition, scalar multiplication, matrix multiplication, and taking the 
conjugate transpose. The main structure theorem of matrix *-algebras is the 
following: 
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Theorem 1.2 Let A C C" x ™ be a matrix *-algebra. There are numbers d, 
and m,\, . . . , m^ so that there is * -isomorphism between A and a direct sum 
of complete matrix algebras 

d 

<p : ^^0C mfcXm ". 
fc=i 

We will give a proof of this theorem in Section 2.5 which also will give 
an algorithm for determining ip. In the case A = (C" X ™) G , the numbers d, 
and mi, . . . , m^ have a representation theoretic interpretation: The numbers 
are determined by the unitary representation ir : G — > U n (C), where d is the 
number of pairwise non-isomorphic irreducible representations contained in ir 
and where m^ is the multiplicity of the fc-th isomorphism class of irreducible 
representations contained in w. 

In the following we present background in representation theory which 
is needed for the understanding of the numbers d and mi , . . . , nid in 
Theorem 1.2. Again, this part may be skipped at first reading. 

A G -homomorphism T between two G-spaces V and W is a linear map 
that commutes with the actions of G on V and W: for all g G G 
and all v £ V we have T(gv) — gT(v). If T is invertible, then it is 
a G '-isomorphism and V and W are G-isomorphic. The set of all G- 
homomorphisms is a linear space, denoted Hom G (V, W). If V = W, 
then T is said to be a G-endomorphism and the set End (V) of G- 
endomorphisms of V is moreover an algebra under composition. If 

V = C™, and if G acts on C n by unitary matrices, then we have 
End G (V) = (C" xn ) G = A. 

A representation n : G — > G\(V) of G (and the corresponding G- 
space V) is irreducible if it contains no proper subspace W such that 
gW C W for all g £ G, i.e. V contains no G-subspace. If V contains a 
proper G-subspace W, then also the orthogonal complement W ± rela- 
tive to a G-invariant inner product, is a G-subspace and V is the direct 
sum W ® W ± . Inductively, one obtains Maschke's theorem: 

Every G-space is the direct sum of irreducible G-subspaces. 

This decomposition is generally not unique: For example, if G acts triv- 
ially on a vector space V (i.e. gv — v for all g € G, v € V) of dimension 
at least 2, the irreducible subspaces are the 1-dimensional subspaces and 

V can be decomposed in many ways. 

From now on, we fix a set 1Z — {Rk : k — 1, . . . , d} of representatives 
of the isomorphism classes of irreducible G-subspaces which are direct 
summands of V. Starting from an arbitrary decomposition of V, we 
consider, for k = 1, . . . , d, the sum of the irreducible subspaces which 
are isomorphic to Rk- One can prove that this G-subspace of V, is 
independent of the decomposition. It is called the isotypic component 
of V associated to Rk and is denoted MTk- The integer m^ such that 
MTk — R™ 1 * is called the multiplicity of Rk in V . In other words, we 
have 
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d m k 

V = 0MI fc and MI t =04, 

fe=l 2=1 

where, Hk,i is isomorphic to Rk, and nik > 1. The first decomposition 

is orthogonal with respect to an invariant inner product and is uniquely 

determined by V, while the decomposition of MTk is not unique unless 

m k = 1. 

Schur's lemma is the next crucial ingredient for the description of 

End G (V): 

If V is G -irreducible, then End G (l/) = {Aid : A £ C} ~ C. 
In the general case, when V is not necessarily G-irreducible, 

d 

End G (T/)~0C mfcXmfc . 

fc=i 

This result will be derived in the next section, in an algorithmic way, as 
a consequence of the more general theory of matrix *-algebras. 

So we consider a *-isomorphism ip given by Theorem 1.2 applied to A = 

(ipnX7i\G . 

d 
j, : (C xn ) G 4 0C m * Xmt . (4) 

fc=l 

Notice that since ip is a ^-isomorphism between matrix algebras with unity, 
ip preserves also eigenvalues and hence positive semidefinitcness. Indeed, let 
X e (C™ xrl ) G be G-invariant, then also X - XI is G-invariant, and X - XI 
has an inverse if and only if <p(X) — XI has a inverse. This means that a test 
whether a (large) G-invariant matrix is positive semidefinite can be reduced 
to a test whether d (small) matrices are positive semidefinite. Hence, applying 
ip to (3) gives the second and final step of simplifying (1): 

Step 2 (first version): If the semidefinite program (1) is G-invariant, 
then it is equivalent to 

max < (X, C) : x\ , . . . , xm € C, 

X = xiBi + • • • + x N B N y 0, ,_. 

(X,Ai) =bi, i = l,...,m, " 



X!<p(Bi) + ■■■+ x N (p(B N ) y o}. 



Applying ip to a G-invariant semidefinite program is also called block di- 
agonalization. The advantage of (5) is that instead of dealing with matrices 
of size n x n one only has to deal with block diagonal matrices with d block 
matrices of size mi, . . . , rrid, respectively. So one reduces the dimension from 
n 2 to m\ + • • • + m 2 d . In the case of a permutation action this sum is also the 
number of distinct orbits M. In many applications the latter is much smaller 
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than the former. In particular many practical solvers take advantage of the 
block structure to speed up the numerical calculations. 

Instead of working with the *-isomorphism ip and a basis Bi , . . . , Bjq of the 
Hcrmitian G-invariant matrices, one can also work with the inverse (p~ x and 
the standard basis of ® fe=1 C" lfcXmfc . This is given by the matrices E k ^ uv € 
C m ' xm ' where all entries of Ek,uv are zero except the (u, w)-entry which 
equals 1. This gives an explicit parametrization of the cone of G-invariant 
positive semidefinite matrices. 

Step 2 (second version) : If the semidefinite program (1) is G-invariant, 
then it is equivalent to 



\{X,C): 



d m k 
k=l u.v=l 



( x >°<™)i<u,v<m h ^ > * = i> ■•■.<*> 

(X,Ai) =b t , i = 1, . . . ,m\. 
Hence, every G-invariant positive semidefinite matrix X is of the form 



(6) 



d m k 



k—l u.v — 1 

where the d matrices 

X k = Ock,m>)i< u ,,,<m*. ' k = l,...,d, 

are positive semidefinite. Define for (i,j) € [n] x [n] the matrix Ek(i,j) G 
C m ' xm ' componentwise by 

[Ek(i,j)]uv = [f^ 1 {E k ^uv)] lj ■ 

By definition we have E k {i,j)* = Ek{j, i). Then, in the case of a permutation 
action, Ek(i,j) — Eh(a(i),a(j)) for all (i,j) € [n] x [n] and a e G. With this 
notation one can write the (i, j)-entry of X by 

d 
X ij =J2(Xk,E k (i,j)). (7) 

fe=i 

In summary, finding a block diagonalization of a G-invariant semidefinite 
program amounts to first identifying a basis of the Hcrmitian G-invariant 
matrices and then in finding an explicit *-isomorphism (4) between the algebra 
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of G-invariant matrices and the direct sum of complete matrix algebras. In 
the following sections we will mainly be concerned with different strategies to 
find such a ^-isomorphism. 

Step 1-|: Reducing the matrix sizes by the regular ^representation 



In general finding a block diagonalization of a G-invariant semidefinite 
program is a non-trivial task, especially because one has to construct an ex- 
plicit *-isomorphism. In cases where one does not have this one can fall back 
to a simpler ^-isomorphism coming from the regular ^-representation. In gen- 
eral this does not provide the maximum possible simplification. However, for 
instance in the case of a permutation action, it has the advantage that one 
can compute it on the level of knowing the orbit structure of the group action 
only. 

For this we consider an orthogonal basis (with respect to the trace inner 
product (•,•)) of the G-invariant algebra (C" xn ) G . For instance, we can use the 
canonical basis C\ , . . . , Cm in the case of a permutation action. By considering 
the multiplication table of the algebra we define the multiplication parameters 
pl sl sometimes also called structural parameters, by 

M 



C r C s =J2Prs C t 



4 = 1 

In the case of a permutation action the structural parameters can be computed 
by knowing the structure of orbits (if one chose the canonical basis): 

pl s = \{k G [n] : (*, k) G R r , {k,j) G R.}\, 

where (i,j) G Rt- Here, p\. s does not depend on the choice of i and j. The 
norms ||G r || = y/{C r ,C r ) equal the sizes of the corresponding orbits. We 
define the matrices L(C r )st G C MxM by 

_ (C r C t ,C.) _\\C.\\ 

Theorem 1.3 Let C the algebra generated by the matrices L{C\), . . . , L(Cm)- 
Then the linear map 

c/> : ( C " X ") G -). C, 4>{C r ) = L(C r ), r = l,...,M, 

is a * -isomorphism. 

We will give a proof of this theorem in Section 2.6. There we will show that 
the *-isomorphism is the regular *-rcprcscntation of the G-invariant algebra 
associated with the orthonormal basis Gi/||Gi||, . . . , Cm/\\Cm\\- Again, since 
4> is a *-isomorphism between algebras with unity it preserves eigenvalues. 
This means that a test whether a G-invariant matrix of size nxnis positive 
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scmidcfinite can be reduced to testing whether an M x M matrix is positive 
scmidefinitc. 

Step 1^ : If the semidefinite program (1) is G-invariant, then it is equiv- 
alent to (5) where the * -isomorphism <p is replaced by <j). 



2 Matrix *-algebras 

In Section 1.2 we saw that the process of block diagonalizing a semidefinite 
program can be naturally done in the framework of matrix *-algebras using 
the main structure theorem (Theorem 1.2). In this section we prove this main 
structure theorem. Although we are mainly interested in the case when the 
matrix *-algebra comes from a group, working in the more general framework 
here, does not cause much extra work. Furthermore the proof of the main 
structure theorem we give here provides an algorithmic way for finding a 
block diagonalization. 

We start by giving the basic definitions, examples, and results of matrix *- 
algebras (Section 2.1-Section 2.4). In Section 2.5 we prove the main structure 
theorem which gives a very efficient representation of a matrix *-algcbra A: 
We show that A is *-isomorphic to a direct sum of full matrix *-algebras. 
The corresponding ^-isomorphism is called a block diagonalization of A. This 
corresponds to Step 2 in the introduction. After giving the proof we interpret 
it in the context of groups and we discuss a numerical algorithm for finding a 
block diagonalization which is based on the proof. In Section 2.6 we consider 
the regular ^representation, which embeds A into C MxM , where M = dim„4. 
This corresponds to Step 1^ in the introduction. 

2.1 Definitions and examples 

Definition 2.1 A matrix *-algebra is a linear subspace A C C" x ™ of complex 
n x n matrices, that is closed under matrix multiplication and under taking 
the conjugate transpose. The conjugate transpose of a matrix A is denoted A* . 

Matrix *-algebras are finite dimensional C*-algebras and many results 
here can be extended to a more general setting. For a gentle introduction to 
C*-algebras we refer to Takesaki [92]. 

Trivial examples of matrix *-algebras are the full matrix algebra C" x ™ and 
the zero algebra {0}. From given matrix *-algebras A C C nx ™ and B C C mxm , 
we can construct the direct sum A © B and tensor product A® B defined by 

A®B = {(*°):AeA,BeB}, 
A B = i Y^ A i ® B i : k e N > A i e A Bi € B I , 
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where A B e ^nmxnm denotes the Kronecker- or tensor product. The corn- 
mutant of A is the matrix *-algebra 

A' = {B e C nxn :BA = AB for all A e A}. 

Many interesting examples of matrix *-algcbras come from unitary group 
representations, as we already demonstrated in the introduction: Given a 
unitary representation 7r : G — > Gl(n, C), the set of invariant matrices 
(C nxn ) G = {A e C nxn : n(g)An{g)- 1 = A} is a matrix *-algebra. It is 
the commutant of the matrix *-algebra linearly spanned by the matrices ir(g) 
with g £ G. If the unitary group representation is given by permutation matri- 
ces then the canonical basis of the algebra (£, nxn ) G are the zero-one incidence 
matrices of orbits on pairs Ci, . . . , Cm, see Step 1 in Section 1.2. 

Other examples of matrix *-algcbras, potentially not coming from groups, 
include the (complex) Bose-Mesner algebra of an association scheme, see e.g. 
Bannai, Ito [13], and Brouwer, Cohen, Neumaier [20] and more generally, the 
adjacency algebra of a coherent configuration, see e.g. Cameron [22]. 

2.2 Commutative matrix *-algebras 

A matrix *-algebra A is called commutative (or Abelian) if any pair of its 
elements commute: AB = BA for all A, B <G A. Recall that a matrix A 
is normal if AA* = A* A. The spectral theorem for normal matrices states 
that if A is normal, there exists a unitary matrix U such that U*AU is a 
diagonal matrix. More generally, a set of commuting normal matrices can be 
simultaneously diagonalized (see e.g. Horn, Johnson [48]). Since any algebra 
of diagonal matrices has a basis of zero-one diagonal matrices with disjoint 
support, we have the following theorem. 

Theorem 2.2 Let A C C™ x " be a commutative matrix *-algebra. Then there 
exist a unitary matrix U and a partition [n] = Sq U Si U • • • U Sk with S\, . . . ,Sk 
nonempty, such that 

U*AU = {AiJi + • • • + Afe/fc : Ai, . . . , A fc e C}, 

where Ii is the zero- one diagonal matrix with ones in positions from Si and 
zeroes elsewhere. 

The matrices E { = UliU* satisfy E + E x -\ \- E k = I, E { E d = S^Ei 

and Ei = E* . The matrices E\ , . . . , Ek are the minimal idempotents of A and 
form an orthogonal basis of A. Unless So — 0, Eg does not belong to A. 

Geometrically, we have an orthogonal decomposition 

c n = V a ®V 1 ®---®V k , 

where Ei is the orthogonal projection onto Vi or equivalently, Vi is the space 
spanned by the columns of U corresponding to Si. The space Vq is the maximal 
subspace contained in the kernel of all matrices in A. 
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2.3 Positive semidefinite elements 

Recall that a matrix A is positive semidefinite (^4 >z 0) if and only if A 
is Hermitian (that is A* = A) and all eigenvalues of A are nonncgative. 
Equivalently, A = U*DU for some unitary matrix U and nonncgative diagonal 
matrix D. 

Let A be a matrix *-algebra. Positive scmidefinitcness can be characterized 
in terms of A. 

Proposition 2.3 An element A G A is positive semidefinite if and only if 
A = B*B for some B G A. 

Proof. The 'if part is trivial. To see the 'only if part, let A E A be positive 
semidefinite and write A = U*DU for some unitary matrix U and (nonncga- 
tive real) diagonal matrix D. Let p be a polynomial with p(\i) — \f\i for all 
eigenvalues \ of A. Then taking B = p(A) E A we have B = U*p(D)U and 
hence B*B = U*p{D)p{D)U = U*DU = A. D 

Considered as a cone in the space of Hermitian matrices in A, the cone of 
positive semidefinite matrices is self-dual: 

Theorem 2.4 Let A E A be Hermitian. Then A y if and only if (A, B) > 
for all B h in A. 

Proof. Necessity is clear. For sufficiency, let A E A be Hermitian and let 
B C A be the *-subalgebra generated by A. By Theorem 2.2 we can write 
A = Ai-Bi + • • • + XkEk, where the Ei are the minimal idempotents of B. The 
Ei are positive semidefinite since their eigenvalues are zero or one. Hence by 
assumption, A« = / ' '\ > 0. Therefore A is a nonnegative linear combination 
of positive semidefinite matrices and hence positive semidefinite. □ 

As a corollary we have: 

Corollary 2.5 The orthogonal projection ixj^ : C" x ™ — > A preserves positive 
semidefiniteness. 

This implies that if the input matrices of the semidefinite program (1) lie in 
some matrix *-algebra, then we can assume that the optimization variable X 
lies in the same matrix *-algebra: If (1) is given by matrices C,A\,..., A m E A 
for some matrix *-algebra A, the variable X may be restricted to A with- 
out changing the objective value. Indeed, any feasible X can be replaced by 
tt_a(X), which is again feasible and has the same objective value. When A is 
the invariant algebra of a group, this amounts to replacing X by the average 
under the action of the group. 
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2.4 *-Homomorphisms and block diagonalization 

Definition 2.6 A map <fi : A —¥ B between two matrix ^-algebras A and B is 
called a *-homomorphism if 

(i) 4> is linear, 
(ii) 4>{AB) = cj){A)(j}{B) for all A, B e A, 

(Hi) (f)(A*) = 4>{A)* for all A e A. 

If 4> is a bisection, the inverse map is also a * -homomorphism and (f> is called 
a *-isomorphism. 

It follows directly from Proposition 2.3 that *-liomomorphisms preserve 
positive semidefiniteness: Let <f> : A —¥ B be a ^-homomorphism. Then <fi(A) 
is positive semidefinite for every positive semidefinite A € A. 

The implication of this for semidefinite programming is the following. 
Given a semidefinite program with matrix variable restricted to a matrix 
*-algebra A and a *-isomorphism A — > B, we can rewrite the semidefinite 
program in terms of matrices in B. This can be very useful if the matrices in 
B have small size compared to those in A. In the following, we will discuss 
two such efficient representations of a general matrix *-algebra. 

2.5 Block diagonalization 

In this section, we study the structure of matrix ^-algebras in some more 
detail. The main result is, that any matrix *-algebra is ^-isomorphic to a 
direct sum of full matrix algebras: 

Theorem 2.7 (— Theorem 1.2) Let A C <C nxn be a matrix *-algebra. 
There are numbers d, and mi, . . . , m^ so that there is ^-isomorphism between 
A and a direct sum of complete matrix algebras 

d 

fc=i 

This theorem is well-known in the theory of C*-algebras, where it is gen- 
eralized to C*-algebras of compact operators on a Hilbert space (cf. Davidson 
[28, Chapter 1.10]). 

Some terminology: Let A be a matrix *-algebra. A matrix *-algebra B 
contained in A is called a *-subalgebra of A. An important example is the 
center of A defined by 

C A = {A e A : AB = BA for all B e A}. 

There is a unique element E <E A such that EA = AE = A for every A € A, 
which is called the unit of A. If A is non-zero and C_a = CE (equivalently: A 
has no nontrivial ideal), the matrix *-algebra A is called simple. 
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We shall prove that every matrix *-algebra is the direct sum of simple 

matrix *-algebras: 

d 

A = EiA, 

where the Ei are the minimal idempotents of Cj±. Every simple matrix *- 
algebra is ^-isomorphic to a full matrix algebra. Together these facts imply 
Theorem 2.7. 

To conclude this section, we will give an elementary and detailed proof of 
Theorem 2.7. 

Proof. Let B C A be an inclusionwise maximal commutative *-subalgebra 
of A. Then any A £ A that commutes with every element of B, is itself 
an element of B. Indeed, if A is normal, this follows from the maximality 
of B since B U {A, A*} generates a commutative *-algebra containing B. If 
A is not normal, then A + A* £ B by the previous argument and hence 
A(A + A*) = (A + A*) A, contradicting the fact that A is not normal. 

By replacing A by U*AU for a suitable unitary matrix U , we may assume 
that B is in diagonal form as in Theorem 2.2. For any A £ A and i, j = 0, . . . , k, 
denote by A^ £ C' *'*' j ' the restriction of liAIj to the rows in Si and 
columns in Sj. Let A £ A and i,j £ {0, ...,fc}. We make the following 
observations: 

(i) An is a multiple of the identity matrix and Aqq = 0, 
(ii) Aoi and Aio are zero matrices, 
(iii) Aij is either zero or a nonzero multiple of a (square) unitary matrix. 

Item (i) follows since UAIi commutes with Jo, . . . ,Ik and therefore belongs 
to B. Hence IiAIi is a multiple of Ii and IqAIq — since IqBIq = {0}. 
Similarly, I AA*I = 0, which implies that IqA = 0, showing (ii). For item 
(iii), suppose that A^ is nonzero and assume without loss of generality that 
\Si\ > \Sj\. Then by (i), AijA*j — XI for some positive real A, and therefore 
has rank \Si\. This implies that \Sj\ ~ \Si\ and vA • Aij is unitary. 

Observe that (ii) shows that I\ + ■ ■ ■ + 1^ is the unit of A. 

Define the relation ~ on {l,...,fc} by setting i ~ j if and only if 
liAIj y^ {0}. This is an equivalence relation. Indeed, ~ is reflexive by (i) 
and symmetric since liAIj = (IjAIi)*. Transitivity follows from (iii) since 
IhAIj 2 (IhAIi)(IiAIj) and the product of two unitary matrices is unitary. 

Denote by {Ei 7 . . . , E d } = {J2j~i Ij '■ i = 1, ■ ■ ■ ,k} the zero-one diagonal 
matrices induced by the equivalence relation. Since the center C^ of A is 
contained in B = C/i + • • • + Clk , it follows by construction that E\ , . . . , E4 
span the center, and are its minimal idempotents. We find the following block 
structure of A: 

A = {0} e e x a e • • • e e a a, 

where the matrix *-algebras EiA are simple. For the rest of the proof we may 
assume that A is simple (d = 1) and that E = 0. 
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Since ~ has only one equivalence class, for every matrix A = {A^y? ■ 1 e 
A, the 'blocks' A t j are square matrices of the same size. Furthermore, we can 
fix an A E A for which all the A^ are unitary. For any B e A, we have 
A u B l0 {A l0 y = (AliBIjA*)!!. By (i), it follows that {AuB^A^)* : B e 
B} = CI. Hence setting U to be the unitary matrix U :— diag(An, . . . , Aik), 
we see that UAU* = {(ay/)- =1 : dij G C}, which shows that A is *- 
isomorphic to C kxk . D 



Relation to group representations 

In the case that A = (C nx ™) G , where 7r : G — >• U n (C) is a unitary represen- 
tation, the block diagonalization can be interpreted as follows. The diagonal- 
ization of the maximal commutative *-subalgebra B, gives a decomposition 
C" = Vi © • • • © 14 into irreducible submodules. Observe that V = {0} since 
A contains the identity matrix. The equivalence relation <~ yields the isotypic 
components Imi^i , . . . , Imi^ , where the sizes of the equivalence classes corre- 
spond to the block sizes rrii in Theorem 2.7. Here Schur's lemma is reflected 
by the fact that IiAIj = Cij if i ~ j and {0} otherwise. 

Algorithmic aspects 

If a matrix *-algcbra A is given explicitly by a basis, then the above proof of 
Theorem 2.7 can be used to find a block diagonalization of A computationally. 
Indeed, it suffices to find an inclusion- wise maximal commutative *-subalgebra 
B C A and compute a common system of eigenvectors for (basis) elements 
of B. This can be done by standard linear algebra methods. For example 
finding a maximal commutative *-subalgcbra of A can be done by starting 
with B = (A) , the *-subalgebra generated by an arbitrary Hcrmitian clement 
in A. As long as B is not maximal, there is a Hcrmitian clement in A \ B 
that commutes with all elements in (a basis of) B, hence extending B. Such 
an element can be found by solving a linear system in O (dim .4) variables and 
0(dim£>) constraints. In at most dim .4 iterations, a maximal commutative 
*-subalgcbra is found. 

Practically more efficient is to find a "generic" Hermitian element A € A. 
Then the matrix *-algebra B generated by A will be a maximal commutative 
*-subalgebra of A and diagonalizing the matrix A also diagonalizes B. Such a 
generic element can be found by taking a random Hermitian element from A 
(with respect to the basis), see Murota et. al. [74]. If a basis for the center of 
A is known a priori (or by solving a linear system in O (dim .4) variables and 
equations), as an intermediate step the center could be diagonalized, followed 
by a block diagonalization of the simple components of A, see Dobre, de Klerk, 
Pasechnik [34]. 



16 C. Bachoc, D.C. Gijswijt, A. Schrijvcr, F. Vallentin 

2.6 Regular ^-representation 

Let A be a matrix *-algebra of dimension M and let C\ , . . . , Cm be an or- 
thonormal basis of A with respect to the trace product (•, •). For fixed A e A, 
left-multiplication by A defines a linear map B i->- AB on A. With respect 
to the ortho normal basis C\, . . . , Cm, this linear map is represented by the 
matrix L(A) e C MxM given by 

L(A) st = (AC t ,C s ). 

The map 



L:A 



•<MxM 



is an injective *-homomorphism called the regular * -representation of A. The 
fact that L is linear and preserves matrix products is clear. Injectivity follows 
from the fact that L(A) = implies AA* = and hence A — 0. Finally, the 
equations 



L(A*) st = (A*C t ,C s ) = (C t ,AC s ) = (AC s ,C t ) = L(A) ts 

show that L(A*) = L(j4)*. Because L is linear, it is determined by the images 

L(Ci),...,L(CW). 

In many applications, for example in the case of a permutation action with 
the canonical basis C\, . . . , Cm (Step 1^ in the introduction), one only has an 
orthogonal basis which is not orthonormal. In that case, the map L is given 
by 

T (n \ _ (C r C t ,C s ) _ \\Cs\\ s 

J^{^r)st — ,,„ I, ,,„ I, — ,,„ ||P rt , 

where the p* s are the multiplication parameters defined by C r C s — ^2 t Pr S Ct. 
If we denote </>(C r ) = L(C r ), we obtain Theorem 1.3. 



3 Invariant positive definite functions on compact spaces 

Until now we considered only finite dimensional invariant scmidefinite pro- 
grams and the question: How can symmetry coming from the action of a 
finite group be exploited to simplify the semidefinite program? In several sit- 
uations, some given in Section 6, one wants to work with infinite dimensional 
invariant semidefinite programs. In these situations using the symmetry com- 
ing from the action of an infinite, continuous group is a must if one wants to 
do explicit computations. In this section we introduce the cone of continuous, 
positive definite functions on a compact set M, as a natural generalization 
of the cone of positive semidefinite matrices. When a compact group G acts 
continuously on M, we use the representation theory of G to describe the 
subcone of G-invariant positive definite functions on M . This is the infinite 



Invariant scmidefinite programs 17 

dimensional analog of Step 2 (second version), given in the introduction, in 
the case of a permutation representation. 

In general, this method evidences interesting links between the geometry 
of M, the representations of G and the theory of orthogonal polynomials. 
We will review the spaces M, finite and infinite, where this method has been 
completely worked out. 

3.1 Positive definite functions 

Let M be a compact space. We denote the space of continuous functions on 
M taking complex values by C(M). 

Definition 3.1 We say that F € C(M 2 ) is positive definite, F >; 0, if 
F(x,y) = F(y,x) and, for all n, for all n-tuples (xi, . . . ,x n ) € M" and 
vectors (ai, . . . , a n ) G C™, 

n 

y^ aiF(xi,Xj)aJ > 0. 
In oi/ier words, for all choices of finite subsets {x\, . . . ,x n } of M , the matrix 

\ F \ Xi > X j))l<i,j<n 

is Hermitian and positive semidefinite. The set of continuous positive definite 
functions on M is denoted C(M 2 )yo- 

In particular, if M is a finite set with the discrete topology, the elements of 
C(M 2 )yo coincide with the Hermitian positive semidefinite matrices indexed 
by M. In general, C(M 2 )yo is a closed convex cone in C(M 2 ). 

We now assume that a compact group G acts continuously on M. Here we 
mean that, in addition to the properties of a group action (see introduction), 
the map G x M — > M, with (g, x) h^ gx, is continuous. We also assume that 
M is endowed with a regular Borel measure fj, which is G-invariant. 

We recall here the basic notions on measures that will be needed. If 
M is a topological space, a Borel measure is a measure defined on the 
cr-algebra generated by the open sets of M. A regular Borel measure is 
one which behaves well with respect to this topology, we refer to Rudin 
[83] for the precise definition. 

If M is a finite set, the topology on M is chosen to be the discrete 
topology, i.e. every subset of M is an open set, and the measure u on M 
is the counting measure defined by n{A) — \A\. 

On a compact group G, there is a regular Borel measure A which is 
left and right invariant, meaning that X(gA) = \(Ag) — \(A) for all 
g G G and A C G measurable, and such that the open sets have positive 
measure. This measure is unique up to a positive multiplicative constant 
and is called the Haar measure on G (see e.g. Conway [25, Theorem 
11.4]). If G is finite, the counting measure on G provides a measure 
with these properties. 
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3.2 Unitary representations of compact groups 

The definitions and properties of representations given in the introduction 
extend naturally to compact groups. We only have to modify the definition 
of a finite dimensional representation by asking that the homomorphism 7r : 
G — > G1(V) is continuous. 

In the proofs, integrating with the Haar measure A of G replaces averaging 
on finite groups. For example, in the introduction, we have seen that every 
representation V of a finite group G is unitary. The argument was, starting 
from an arbitrary inner product (u,v) on V, to average it over all elements 
of G, in order to transform it into a G-invariant inner product. In the case 
of a more general compact group, we average over G using the Haar measure 
(normalized so that A(G) = 1), thus taking: 



(u,v) := / (gu,gv)d\(g). 
Jg 

So Maschke's theorem holds: every finite dimensional G-space is the direct 
sum of irreducible G-subspaces. In contrast to the case of finite groups, the 
number of isomorphism classes of irreducible G-spaces is generally infinite. 
We fix a set 1Z = {Rk : k > 0} of representatives, assuming for simplicity that 
this set is countable (it is the case e.g. if G is a Lie group). Here i?o is the 
trivial representation, i.e. V — C with gz — z for all g € G, z G C. 

However, infinite dimensional representations arise naturally in the study 
of infinite compact groups. We shall be primarily concerned with one of them, 
namely the representation of G defined by the space C(M) with the action of 
G defined by ir(g){f){x) = f{g~ l x) (of course it has infinite dimension only 
if M itself is not a finite set). It is a unitary representation for the standard 
inner product on C(M): 



(fuh) = — r^y / fi{x)f 2 (x)dfi(x). 

Moreover C(M) is a dense subspace of the Hilbert space L 2 (M) of measur- 
able functions / on M such that |/| 2 is ^i-integrable (see e.g. Conway [25, 
Chapter 1] or Rudin [83, Chapter 4]). It turns out that the theory of finite 
dimensional representations of compact groups extends nicely to the unitary 
representations on Hilbert spaces: 

Theorem 3.2 Every unitary representation of a compact group G on a 
Hilbert space is the direct sum (in the sense of Hilbert spaces) of finite di- 
mensional irreducible G-subspaces. 

The above theorem is in fact a consequence of the celebrated Theorem of 
Peter and Weyl (see e.g. Bump [21]), which states that the matrix coefficients 
of finite dimensional irreducible representations of a compact group G span a 
subspace of C(G) which is dense for the topology of uniform convergence. 
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3.3 G-invariant positive definite functions on M 

Now we come to our main concern, which is to describe the elements of 
C(M 2 )y , i.e. the G-invariant positive definite functions, in terms of the struc- 
ture of the G-space M. In order to avoid technicalities with convergence issues, 
we shall systematically work in finite dimensional subspaces of C(M). So let 
Vbea G-subspace of C(M) of finite dimension. Let V^ denote the subspace 
of C(M 2 ) spanned by elements of the form fi(x)f2(y), where /i and fa belong 
to V. 

We take the following notations for an irreducible decomposition of V: 

m k 

V=0®%, (8) 

k£l v i=l 

where for 1 < i < mfc, the subspaces Hk,i are pairwise orthogonal and G- iso- 
morphic to Rk , and where ly is the finite set of indices fc > such that the 
multiplicities mk are not equal to 0. For all fc, i, we choose an orthonormal 
basis (efc j i, . . . , e^ j <j fc ) °f ^fc,*' where dk = dim(Rk), such that the complex 
numbers (gek,i,si e k,i,t) do not depend on i (such a basis exists precisely be- 
cause the G- isomorphism class of Hk.i does not depend on i). From this data, 
we introduce the mk x m,k matrices Ek{x,y) with coefficients Ek.ij{x,y): 

dk 

E k ,ij{x,y) = y^ j ek,i, s (x)ek,j, s {y)- (9) 

8=1 

Then V^ 2 ' obviously contains the elements ek.i,s(x)ekj, s (y), which moreover 
form an orthonormal basis of this space. 

Theorem 3.3 Let F G V^ 2 ' . Then F is a G-invariant positive definite func- 
tion if and only if there exist Hermitian positive semidefinite matrices i*fc such 
that _ 

F(x,y)= Y,(F k ,E k (x,y)). (10) 

fcG/v 

Before we give a sketch of the proof, several comments arc in order: 

1. If M = [n], then we recover the paramctrization of G-invariant positive 
scmidefinite matrices given in (7). 

2. The main point of the above theorem is to replace the property of a contin- 
uous function being positive definite with the property that the mk x mj 
matrices F k are positive scmidefinite. In the introduction, we have al- 
ready seen in the finite case M — [n] that it allows to reduce the size 
of invariant semidefinite programs (following Step 2 (second version) of 
the introduction). In the general case, this theorem allows to replace a 
conic linear program involving the cone C(M 2 )yo with standard semidef- 
inite programs (see Section 6 for examples where M is the unit sphere of 
Euclidean space). 



20 C. Bachoc, D.C. Gijswijt, A. Schrijvcr, F. Vallentin 

3. One might wonder to what extent the above matrices Ek{x, y) depend on 
the choices involved in their definition. Indeed, one can prove that a differ- 
ent choice of orthonormal basis in H^i does not affect E^{x, y). One could 
also start from another decomposition of the isotypic component MXk of 
V. The new decomposition does not need to be orthogonal; the matrix 
E k (x,y) would then change to AEk(x, y)A* for some A e Gl„ lfc (C). So, 
up to an action of Gl mfc (C), the matrix Ek{x,y) is canonically associated 
to Mlk- 

4. The statement implies that the matrices E^{x,y) themselves are G- 
invariant. In other words, they can be expressed in terms of the orbits 
0(x, y) of the action of G on M 2 in the form: 

E k (x,y)=Y k (0(x,y)) (11) 

for some matrices Y k . It is indeed the expression we are seeking for. 



Proof. F(x,y) is a linear combination of the ek,i,s{x)ei,j,t(y) since these ele- 
ments form an orthonormal basis of V^ 2 K In a first step, one shows that the 
G-invariance property F(gx,gy) = F(x,y) results in an expression for F of 
the form (10) for some matrices Fk — (fk,ij)', the proof involves the orthog- 
onality relations between matrix coefficients of irreducible representations of 
G (see e.g. Bump [21]). In a second step, F >z is shown to be equivalent to 
Fk t: for all k € Iy- Indeed, for a(x) — Y^i^i a i e k,i,s{x), we have 

V aif k ,ijoq= , , 2 / a{x)F(x,y)a(y)d/j,(x,y) > 0. 

□ 

Remark 3.4 1. A straightforward generalization of the main structure the- 
orem for matrix ^-algebras to the *-algebra End (V) shows that 

End G (T0 ~ (V (2) ) G ~ C mfcXmfe . (12) 

kei v 

An isomorphism from (V^ 2 ') G to the direct sum of matrix algebras is con- 
structed in the proof of Theorem 3.3, in an explicit way, from the decom- 
position (8). This isomorphism is given by the map sending (Fk)kei v € 
®kei v C mkXmk to F e (V^) G defined by: 

F(x,y)= ^2{F k ,E k (x,y)) 

kei v 

thus completing Step 2 (second version) of the introduction for compact 
groups. 
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2. Theorem 3.3 shows, jointly with Theorem 3.2, that any element of the cone 
C(M 2 ) G is a sum (in the sense of L 2 convergence) of the form (10) with 
possibly infinitely many terms. Moreover, under the additional assumption 
that G acts transitively on M , Bochner [17] proves that the convergence 
holds in the stronger sense of uniform convergence, i.e. for the supremum 
norm. 



3.4 Examples: the commutative case 

We mean here the case when End (C(M)) is a commutative algebra. From 
Theorem 3.3, this condition is equivalent to the commutativity of End (V) for 
all finite dimensional G-subspace of C(M). So, from the isomorphism (12), and 
since a matrix algebra C mxm is commutative if and only if m = or m = 1, 
the commutative case corresponds to non vanishing multiplicities equal to 1 in 
the decomposition oiC(M). The G-space C(M) is said to be multiplicity- free. 
Then the matrices F^ in (10) have only one coefficient f k and F G C(M 2 ) G _ 
is of the form 

F(x,y) = J2f k E k (x,y), f k >0. (13) 

We say that M is G- symmetric, if G acts transitively on M (i.e. there is only 
one orbit in M; equivalcntly M is said to be a homogeneous space), and if 
moreover, for all (x, y) £ M 2 , there exists g <G G such that gx = y and gy = x. 
In other words (y,x) belongs to the G-orbit 0{x,y) of the pair (x,y). This 
nonstandard terminology (sometimes it is also called a generously transitive 
group action) covers the case of compact symmetric Riemannian manifolds 
(see e.g. Berger [16]) as well as a large number of finite spaces. We provide 
many examples of such spaces below. The functions Y& (11) are often referred 
to as the spherical functions or zonal spherical functions of the homogeneous 
space M (see e.g. Vilcnkin, Klimyk [98]). 

Lemma 3.5 If M is G-symmetric then for all V C C{M), (V^) G is com- 
mutative. 

Proof. Since F <G (V^) G is G-invariant and M is G-symmetric, F(x,y) = 
F(y,x). The multiplication on the algebra (V^) G ~ End (M) is the convo- 
lution F * F' of functions, given by: 

(F*F')(x,y)= f F(x,z)F'(z,y)du(z). 

A straightforward computation shows that F * F' — F' * F. D 

An important subclass of symmetric spaces is provided by the two-point homo- 
geneous spaces. These spaces are metric spaces, with distance d{x, y) invariant 
by G, and moreover satisfy that two pairs of elements (x,y) and (x',y') be- 
long to the same G-orbit if and only if d{x, y) = d(x' , y'). They are obviously 
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G-symmetric since d(x,y) = d(y,x). In other words, the distance function 
parametrizes the orbits of G acting on pairs, thus the expression (13) of pos- 
itive definite functions specializes to 

F(x, y) = J2 fkYk(d(x, y)), h > 0, (14) 

k>0 

for some functions Yfe in one variable. The functions Yk arc explicitly known for 
many spaces, and are usually given by a certain family of orthogonal polynomi- 
als (see e.g. Szego [91], Andrews, Askey, Roy [1]). The orthogonality property 
arises because the associated irreducible subspaces are pairwise orthogonal. 

The complete list of real, compact, connected two-point homogeneous 
spaces was established by Wang [99]: the unit sphere of Euclidean space 
S*™ -1 C R", which is treated in Section 6, the projective spaces over the 
fields of real, complex and quaternion numbers (denoted P n ~ 1 (K), with re- 
spectively K = M, C, H), and the projective plane over the octonions P 2 (0), 
are the only spaces having these properties. In each of these cases the functions 
Yfc are given by Jacobi polynomials (see Table 1). These polynomials depend 
on two parameters (a, (3) and are orthogonal for the measure (1 — t) a (l+t)^dt 
on the interval [—1, 1]. We denote by PJf ' (t) the Jacobi polynomial of de- 
gree fc, normalized by the condition PJf (1) = 1. When a — j3 = A — 1/2, 
these polynomials are equal (up to a nonnegative multiplicative factor) to the 
Gegenbauer polynomials C£ (t) . 



Space Group a f3 



S 7 ^" 0„(R) (n-3)/2 (n-3)/2 
P"- 1 ^) 0„(M) (n-3)/2 -1/2 
P"^(C) t/„(C) n-2 

P"^(B) U„(U) 2n-3 1 

P 2 (Q) F 4 (R) 7 3 

Table 1. The real compact two-point homogeneous spaces, their groups and their 
spherical functions Yfe. 



The finite two-point homogeneous spaces are not completely classified, but 
several of them are relevant spaces for coding theory. The most prominent one 
is the Hamming space q™, associated to the Krawtchouk polynomials, which 
is discussed in Section 4. It is the space of n-tuples, over a finite set q of q 
elements. An element of the Hamming space q™ is usually called a word, and 
q is called the alphabet. The closely related Johnson spaces are the subsets of 
binary words of fixed length and weight. The Hamming and Johnson spaces 
are endowed with the Hamming distance, counting the number of coordinates 
where to words disagree. The Johnson spaces have g-analogues, the q- Johnson 
spaces, the subsets of linear subspaces of IF™ of fixed dimension. The distance 
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between two subspaces of the same dimension is measured by the difference 
between this dimension and the dimension of their intersection. Other spaces 
are related to finite geometries and to spaces of matrices. In the later case, 
the distance between two matrices will be the rank of the difference of the 
matrices. 

The spherical functions have been worked out, initially in the framework of 
association schemes with the work of Dclsarte [29] , and later in relation with 
harmonic analysis of finite classical groups with the work of Delsarte, Dunkl, 
Stanton and others. We refer to Conway, Sloane [26, Chapter 9] and to Stanton 
[88] for surveys on these topics. Table 2 displays the most important families 
of these spaces, together with their groups of isometries and the family of 
orthogonal polynomials to which their spherical functions belong; see Stanton 
[881 for additional information. 



Space Group Polynomial Reference 

Hamming space q n S q I S„ Krawtchouk [29] 

Johnson space S n Halm [29], [31] 

q- Johnson space Gl n (F g ) g-Hahn [31] 

Maximal totally isotropic subspaces of dimension k, for a nonsingular bilinear form: 

Symmetric S02fc+i(F g ) g-Krawtchouk [88] 

S0 2fc (F g ) g-Krawtchouk [88] 

SO- k+2 (W q ) g-Krawtchouk [88] 

Symplectic Sp 2fe (F g ) g-Krawtchouk [88] 

Hermitian SU2fc(F g 2) g-Krawtchouk [88] 

SU 2 fe+i(F 9 2) g-Krawtchouk [88] 

Spaces of matrices: 

¥ k q xn ¥ k q x ".(G\ k (¥ q ) x G1„(F,)) Affine g-Krawtchouk [32], [88] 

Skew n (F g ) skew-symmetric Skew n (F g ). Gl n (F g ) Affine g-Krawtchouk [30], [88] 

Herm n (F g 2) Hermitian Herm n (F g 2). Gl n (F g 2) Affine g-Krawtchouk [88] 

Table 2. Some finite two-point homogeneous spaces, their groups and their spherical 
functions Y k . 



Symmetric spaces which are not two-point homogeneous give rise to func- 
tions Yfc depending on several variables. A typical example is provided by 
the real Grassmann spaces, the spaces of m-dimensional subspaces of R™, 
m < n/2, under the action of the orthogonal group 0(M. n ). Then, the func- 
tions Yfe are multivariate Jacobi polynomials, with m variables representing 
the principal angles between two subspaces (see James, Constantine [49]). 
A similar situation occurs in coding theory when nonhomogeneous alphabets 
are considered, involving multivariate Krawtchouk polynomials. Table 3 shows 
some examples of these spaces with references to coding theory applications. 



Nonbinary Johnson 


Oq — 1 ( On 


Permutation group 


>J n 


Lee space 


D q \S n 


Ordered Hamming space 


(F k q .B k )lS, 


Real Grassmann space 


On(K) 


Complex Grassmann space 


■ U n (C) 


Unitary group 


U n (Cf 
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Space Group Polynomial Reference 

Eberlein/Krawtchouk [93] 

Characters of S n [94] 

Multivariate Krawtchouk [3] 
Multivariate Krawtchouck [71], [14] 
Multivariate Jacobi [49], [4] 

Multivariate Jacobi [49], [81], [82] 

Schur [27] 

Table 3. Some symmetric spaces, their groups and their spherical functions Yfe. 



3.5 Examples: the noncommutative case 

Only a few cases have been completely worked out. Among them, the Eu- 
clidean sphere S*™ -1 under the action of the subgroup of the orthogonal group 
fixing one point is treated in Bachoc, Vallentin [6]. We come back to this case 
in Section 6. These results have been extended to the case of fixing many 
points in Musin [75] . The case of the binary Hamming space under the action 
of the symmetric group is treated in the framework of group representations 
in Vallentin [96] , shedding a new light on the results of Schrijver [86] . A very 
similar case is given by the set of linear subspaces of the finite vector space 
F™, treated as a q-analoguc of the binary Hamming space in Bachoc, Vallentin 
[8]. 



4 Block codes 

In this section, we give an overview of symmetry reduction in semidefinite 
programming bounds for (error correcting) codes. We fix an alphabet q = 
{0, 1, . . . , q — 1} for some integer q > 2. The Hamming space q" is equipped 
with a metric d(-, •) called the Hamming distance which is given by 

d(u,v) — \{i : Ui 7^ Vi]\ for all «,ueq". 

The isometry group of the Hamming space will be denoted Aut(g, n) and 
is a wreath product S q I S n of symmetric groups. That is, the isometries are 
obtained by taking a permutation of the n positions followed by independently 
permuting the symbols 0, . . . , q — 1 at each of the n positions. The group acts 
transitively on q™ and the orbits of pairs are given by the Hamming distance. 
A subset C C q™ is called a code of length n. For a (nonempty) code C, 
min{d(u, v) : u, v € C distinct} is the minimum distance of C. An important 
quantity in coding theory is the maximum size A q (n, d) of a code of length n 
and minimum distance at least d: 

A q {n, d) — max{|C| :CCq™ has minimum distance at least d}. 
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These codes are often called block codes since they can be used to encode 
messages into fixed-length blocks of words from a code C. The most studied 
case is that of binary codes, that is q = 2 and q = {0, 1}. In this case q is 
suppressed from the notation. An excellent reference work on coding theory 
is Mac Williams, Sloane [70]. 

Lower bounds for A q (n, d) are mostly obtained using explicit construc- 
tion of codes. Our focus here is on upper bounds obtained using semidcfinitc 
programming. 

4.1 Lovasz' i? and Delsarte's linear programming bound 

Let r q (n,d) be the graph on vertex set q™, connecting two words u, v £ q n 
if d(u, v) < d. Then the codes of minimum distance at most d correspond 
exactly to the independent sets in r q (n, d) and hence 

A q (n,d) = a(r q (n,d)), 

where a denotes the independence number of a graph. The graph parameter 
•&' , defined in Schrijvcr [86], is a slight modification of the Lovasz theta num- 
ber discussed in more detail in Section 6. It can be defined by the following 
semidcfinitc program 

tf'(T) = max{(X, J) : (X, I) = 1, X uv = if uv e E(r),X >0,Xh 0}, 

and it is a by-now classical lemma of Lovasz [69] that for any graph r the 
inequality a(T) < i?'(r) < i9(T) holds. 

Hence i9'(.T g (n, d)) is an (exponential size) semidcfinitc programming up- 
per bound for A q (n, d). Using symmetry reduction, this program can be solved 
efficiently. For this, observe that the SDP is invariant under the action of 
Aut(q,n). Hence we may restrict X to belong to the matrix *-algebra A of 
invariant matrices (the Bose-Mesner algebra of the Hamming scheme). The 
zero-one matrices Aq, . . . , A n corresponding to the orbits of pairs: 

(l iid(u,v)=i, 
[0 otherwise, 

form the canonical basis of A. Writing X = -^(xoAq + • • • +x n A n ), we obtain 
an SDP in n + 1 variables: 

n 

■d'(r q (n,d)) = max j J^ re; (") : x Q = l,x 1 = ••• = x d _ 1 =0, 

j=0 

n 

x d ,- ■ -,x n > 0, y^XjAj y oj. 

j=0 
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The second step is to block diagonalize the algebra A. In this case A is commu- 
tative, which means that the Ai can be simultaneously diagonalized, reducing 
the positive semidefinite constraint to the linear constraints 

n 

Y x i P iU) > for j = 0, . . . , n, 

»=o 

in the eigenvalues P{(j) of the matrices Ai. The Pj are the Krawtchouk poly- 
nomials and are given by 



P(ao = £(-iro(r fe x )(9-ir 



Thus the semidefinite program is reduced to a linear program, which is pre- 
cisely the Delsarte bound [29] . This relation between ■&' and the Delsarte bound 
was recognized independently in McEliece, Rodcmich, Rumsey [72] and Schri- 
jver [85]. In the more general setting of (commutative) association schemes, 
Delsarte's linear programming bound is obtained from a semidefinite program 
by symmetry reduction and diagonalizing the Bosc-Mesner algebra. 

4.2 Stronger bounds through triples 

Delsarte's bound is based on the distance distribution of a code, that is, on 
the number of code word-pairs for each orbit of pairs. Using orbits of triples, 
the bound can be tightened as was shown in Schrijver [86]. We describe this 
method for the binary case (q = 2). 

Denote by Stab(0, Aut(2,n)) C Aut(2,n) the stabilizer of 0. That is, 
Stab(0, Aut(2,n)) consists of just the permutations of the n positions. 

For any u,v,w £ {0,1}™, denote by 0(u, v, w) the orbit of the triple 
(u, v, w) under the action of Aut(2, n). Observe that since Aut(2, n) acts tran- 
sitively on {0, 1}™, the orbits of triples are in bijection with the orbits of pairs 
under Stab(0, Aut(2, n)), since we may assume that u is mapped to 0. There 
arc ( ™T ) such orbits, indexed by integers < t < i,j < n. Indeed, the or- 
bit of (v,w) under Stab(0, Aut(2,n)) is determined by the sizes i, j and t of 
respectively the supports of v and w and their intersection. 

Let C C {0, 1}™ be a code. We denote by A* the number of triples 
(u,v,w) € C 3 in the orbit indexed by i,j and t. This is the analogue for 
triples of the distance distribution. 

Denote by Mc the zero-one {0, l}"x{0, 1}™ matrix defined by (Mc) u ,v = 1 
if and only if u, v £ C. So Mc is a rank 1 positive semidefinite matrix. Now 
consider the following two matrices 

M'= J2 M aC , M"= Y, M °c- 

<r6Aut(2,n),0eo-C a£Aut(2,n),0gcrC 
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Clearly, M' and M" are nonnegative and positive scmidefinite. Furthermore, 
by construction M' and M" are invariant under the action of the stabilizer 
Stab(0, Aut(2, n)). Equivalently, any entry M' uv (or M" v ) only depends on the 
orbit of (0,u,v) under the action of Aut(2,n). In fact, M' uv equals (up to a 
factor depending on the orbit) the number of triples (c,c\c") G C 3 in that 
orbit. 

The matrix *- algebra A n of complex {0,1}™ x {0,1}™ matrices invariant 
under Stab(0, Aut(2, n)) has a basis of zero-one matrices corresponding to the 
orbits of pairs under the action of Stab(0, Aut(2, n)): {A* : < t < i, j < n}. 
This algebra is called the Terwilliger algebra of the (binary) Hamming scheme. 
The facts about M' and M" above lead to a scmidefinite programming bound 
for A(n, d), in terms of variables x\ ■. Nonnegativity of M',M" translates 
into nonnegativity of the x\ -. Excluding positive distances smaller than d 
translates into setting x\ • = for orbits containing two words at distance 1 
through d — 1. Scmidcfinitcncss of M' and M" translate into 



i,j,t i,j,t 



X i,j^iJ — 0' £^\ X i+j-t,0 X i,j)AiJ ^1 0. 



There are some more constraints. In particular, for u,v,w € {0,1}™ the vari- 
ables of the orbits of (u 7 v 7 w) and (w,u 7 v) (or any other reordering) must 
be equal. This further reduces the number of variables. For more details, see 
Schrijver [86]. 

Having reduced the SDP-variables by using the problem symmetry, the 
second step is to replace the positive semidefinite constraints by equivalent 
conditions on smaller matrices using a block diagonalization of A n . The block 
diagonalization of A n is given explicitly in Schrijver [86] (see also Vallcntin 
[96] , Dunkl [36] ) . The number of blocks equals 1 + [^ J , with sizes n + 1 — 2fc for 
k = 0, . . . , [f J • Observe that the sum of the squares of these numbers indeed 
equals ("J 3 ), the dimension of A n . 

For the non- binary case, an SDP upper bound for A q (n 1 d) can be defined 
similarly. The main differences are, that the orbits or triples are now indexed 
by four parameters i,j,p,t satisfying 0<p<t<i,j,i + j — t < n. Hence 
the corresponding invariant algebra is a matrix *-algebra of dimension ( ^ J . 
In that case the block diagonalization is a bit more involved having blocks 
indexed by the integers a, k with 0<a<k<n + a— fc, and size n + a + 1 — 2k. 
See Gijswijt, Schrijver, Tanaka [42] for details. 

4.3 Hierarchies and fe-tuples 

Moment matrices 

Let V be a finite set and denote by V(V) its power set. For any y : ^(V) — )■ C, 
denote by M(y) the V(V) x V(V) matrix given by 

[M(y)] s ,T = Vsut- 
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Such a matrix is called a combinatorial moment matrix (see Laurent [66]). 

Let C C V and denote by x = x C <= {0, l} y the zero-one incidence vector 
of C. The vector x can be "lifted" to y e {0, 1} V ( V *> by setting y s = flues x « 
for all S 1 € V(V). Note that 3/0 = 1. Then the moment matrix M(y) is positive 
semidefinite since M(y) — yy T ■ This observation has an important converse 
(see Laurent [66]). 

Theorem 1. Let M(y) be a moment matrix with 2/0 = 1 and M{y) y 0. Then 
M(y) is a convex combination of moment matrices M (j/i), . . . , M{yk), where 
yi is obtained from lifting a vector in {0, 1} V . 

Let r = (V, E) be a graph on V. Theorem 1 shows that the independence 
number is given by 

a(r) = max < > yr v \ : y^ = 1, ys = if S contains an edge, 

vev ( 15 ) 

M(y) h 0}. 

By replacing M(y) by suitable principal submatrices, this (exponential size) 
scmidchnite program can be relaxed to obtain more tractable upper bounds 
on a(.T). For example, restricting rows and columns to sets of size at most 1 
(and restricting y to sets of size < 2), we obtain the Lovasz i9 number (adding 
nonncgativity of ys for S of size 2 gives •&'). 

To describe a large class of useful submatrices, fix a nonnegative integer 
s and a set T C V. Dehne M s _t = Af s ,T(y) to be the matrix with rows and 
columns indexed by sets S of size at most s, defined by 

[M s , T }s,s' = VsuS'ut- (16) 

So M s ^t is a principal submatrix of M{y) 1 except that possibly some rows 
and columns are duplicated. For fixed k, we can restrict y to subsets of size 
at most k and replace M(y) y in (15) by conditions 

M s ,t cl for all T of size at most t, 

where 2s+t < k. This yields upper bounds on a(r) defined in terms of subsets 
of V of size at most k. The constraints (16) can be strengthened further to 

M s ,t = (M^ t -ut»)t',t»ct - ° for a11 T of size *■ ( 17 ) 

Here M s ,t is a 2' T I x 2' T ' matrix of smaller matrices, which can be seen as 
a submatrix of M{y) (again up to duplication of rows and columns). By the 
moment-structure of M s _t, condition (17) is equivalent to 

Y^ (-1)\ R \ R '\M S , R , y for all R of size at most t. 

R'CR 



Invariant scmidefinite programs 29 

Using SDP constraints of the form (16) and (17) with 2s + 1 < fc, we can 
obtain the (modified) theta number (fc = 2), Schrijver's bound using triples 
(fc = 3), or rather a strengthened version from Laurent [65], and the bound 
from Gijswijt, Mittelmann, Schrijvcr [41] (fc = 4). 

Several hierarchies of upper bounds have been proposed. The hierarchy 
introduced by Lasserre [64] (also see Laurent [65]) is obtained using (16) 
by fixing t = and letting s = 1,2,... run over the positive integers. The 
hierarchy described in Gijswijt, Mittelmann, Schrijver [41] is obtained using 
(16) by letting k = 1, 2, . . . run over the positive integers and restricting s and t 
by 2s+t < k. Finally, the hierarchy defined in Gvozdcnovic, Laurent, Vallcntin 
[47] fixes s — 1 and considers the constraints (17), where t = 0, 1, . . . runs over 
the nonnegative integers. There, also computational results are obtained for 
Paley graphs in the case t — 1,2. 

A slight variation is obtained by restricting y to sets of at least one element, 
deleting the row and column corresponding to the empty set from (submatrices 
of) M(y). The normalization y® = 1 is then replaced by X^ev^M = ^ 
and the objective is replaced by EtitiEV^{».»)' Since M 1 j y implies that 
(J2vevV{v}) 2 < V0^2u,vevy{u,v}, this gives a relaxation. This variation was 
used in Schrijver [86] and Gijswijt, Schrijver, Tanaka [42]. 

Symmetry reduction 

Application to the graph r = r q (n, d) with vertex set V = q", gives semidcf- 
inite programming bounds for A q (n,d). A priory, this gives an SDP that has 
exponential size in n. However, using the symmetries of the Hamming space, 
it can be reduced to polynomial size as follows. 

The action of Aut(q, n) on the Hamming space V, extends to an action 
on V{V) and hence on vectors and matrices indexed by V{V). Since the 
semidefinite program in (15) is invariant under Aut(<7, n), we may restrict 
y (and hence M{y)) to be invariant under Aut(g,n), without changing the 
optimum value. The same holds for any of the bounds obtained using principal 
submatrices. Thus M(y)s,r only depends on the orbit oi SUT. In particular, 
for any fixed integer fc, the matrices M s ^t with 2s+ \T\ < k are described using 
variables corresponding to the orbits of sets of size at most fc. Two matrices 
M s ^t and M s ^t' are equal (up to permutation of rows and columns) when T 
and T" are in the same orbit under Aut(g,n). Hence, the number of variables 
is bounded by the number of orbits of sets of size at most fc (0(n 2 _1 ) in 
the binary case) , and for each such orbit there are at most a constant number 
of distinct constraints of the form (16,17). This concludes the first step in the 
symmetry reduction. 

The matrices M Si t are invariant under the subgroup of Stab(T, Aut(g, n)) 
that fixes each of the elements of T. The dimension of the matrix *-algebra of 
invariant matrices is polynomial in n and hence the size of the matrices can 
be reduced to polynomial size using the regular ^-representation. 
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By introducing more duplicate rows and columns and disregarding the row 
and column indexed by the empty set, we may index the matrices M s x by 
ordered s-tuples and view M S _ T as an (q™) s x (q") s matrix. Let A s _t be the 
matrix *-algebra of (q") s x (q™) s matrices invariant under the the subgroup 
of Stab(T, Aut(g, n)). This matrix *-algebra can be block diagonalized using 
techniques from Gijswijt [40]. 

4.4 Generalizations 

The semidcfinitc programs discussed in the previous sections, are based on 
the distribution of pairs, triples and /c-tuples in a code. For each tuple (up to 
isometry) there is a variable that reflects the number of occurrences of that 
tuple in the code. Exclusion of pairs at distance l,...,d— 1, is modeled by 
setting variables for violating tuples to zero. 

Bounds for other types of codes in the Hamming space can be obtained by 
setting the variables for excluded tuples to zero. This does not affect the under- 
lying algebra of invariant matrices or the symmetry reduction. For triples in 
the binary Hamming space, this method was applied to orthogonality graphs 
in de Klerk, Pasechnik [58] and to pseudo-distances in Bachoc, Zcmor [10]. 
Lower bounds on (nonbinary) covering codes have been obtained in Gijswijt 
[39] by introducing additional linear constraints. 

Bounds on constant weight binary codes were given in Schrijvcr [86]. 

In the nonbinary Hamming space, the symbols 0, . . . , q — 1 arc all in- 
terchangeable. In the case of Lee-codes, the dihedral group D q acts on the 
alphabet, which leads to a smaller isometry group D q l S n . The Bose-Mesner 
algebra of the Lee-scheme is still commutative. The corresponding linear pro- 
gramming bound was studied in Astola [3]. To the best knowledge of the 
author, stronger bounds using triples have not been studied in this case. 



5 Crossing numbers 

We describe an application of the regular ^-representation to obtain a lower 
bound on the crossing number of complete bipartite graphs. This was de- 
scribed in dc Klerk, Pasechnik, Schrijver [59], and extends a method of de 
Klerk, et. al. [55]. 

The crossing number cr(_T) of a graph _T is the minimum number of in- 
tersections of edges when r is drawn in the plane such that all vertices 
are distinct. The complete bipartite graph K mn is the graph with vertices 
1, . . . , m, Mi, . . . ,u n and edges all pairs iuj for i e [m] and j e [n\. (This 
notation will be convenient for our purposes.) 

This relates to the problem raised by the paper of Zarankiewicz [101], 
asking if 

cr(K m>n ) = Z(m, n) = \h{m- 1) 2 J [\{n - 1) 2 J . (18) 
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In fact, Zarankicwicz claimed to have a proof, which however was shown to be 
incorrect. In (18), < follows from a direct construction. Equality was proved 
by Kleitman [53] if min{m,n} < 6 and by Woodall [100] if to € {7,8} and 
n£ {7,8,9, 10}. 

Consider any to, n. Let Z m be the set of cyclic permutations of [to] (that 
is, the permutations with precisely one orbit). For any drawing of K m . n in the 
plane and for any m, let j(ui) be the cyclic permutation (1, 12, ■ ■ ■ , i m ) such 
that the edges leaving m in clockwise order, go to l,ii, ■ ■ ■ ,i m respectively. 

For a,r £ Z m , let C G ^ be equal to the minimum number of crossings when 
we draw K m ^ in the plane such that 7(^1) = o and 7(1^2) = r. This gives a 
matrix C = {C a ^) in R z "» xZ "». Then the number a m is defined by: 

a m = min{(X,C) : X £ K^» xZ ™, X h 0, {X, J) = 1}, (19) 

where J is the all-one matrix in M. Zm x Zm . 

Then a m gives a lower bound on ci{K mtn ), as was shown by de Klerk, et. 

al. [55]. 

Theorem 5.1 cr(K mn ) > ^n 2 a m — ^n[j(m — 1) 2 J for all to, n. 

Proof. Consider a drawing of K m , n in the plane with cr(K m . n ) crossings. For 
each cyclic permutation a, let d a be the number of vertices Ui with j(ui) = a. 
Consider d as column vector in M. Zm , and define the matrix X in M. Zm x Zm by 

X = n - 2 dd T . 

Then X satisfies the constraints in (19), hence a m < (X,C). For i,j — 
1, . . . , n, let pij denote the number of crossings of the edges leaving Ui with 
the edges leaving Uj. So if i 7^ j, then /3jj > C 7 ( Ui ) j7 ( u .). Hence 



n 2 (X,C) = (dd T C)) = 


- d Cd = 2_^ Cfiu^^iuj) 


n n 


= 2cr(iiT mi „) +n[\(m- 


i+3 




Therefore, 




CT(K m , n ) > \n 2 {X,C) - \n[\{m- 


l) 2 j > \a m n 2 -\n[\{n 



iy\. 

The semidcfinite programming problem (19) defining a m has order (m — 
1)!. Using the regular ^representation, we can reduce the size. Fix m £ N. 
Let G = S m x { — 1, +1}, and define h : G —J- Sz m by 



h V} i(a) = 7rc l 7r 



i_-l 
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for 7r G S m , i G { — 1, +1}, & G ■Zm- So G acts on Z m . It is easy to sec that C 
and (trivially) J are G-invariant. 

Using the regular *-representation, a m up to m — 9 was computed. For 
m = 8, a 8 = 5.8599856444 . . ., implying 

crCftTg.n) > 2.9299n 2 - 6n. 

This implies for each fixed m > 8, with an averaging argument over all sub- 
graphs K s ,„: 

cr(K„, „) to 

lim }, m ' n > 0.8371- 



n^oo Z(m,n) 771—1 

Moreover, ag = 7.7352126 . . ., implying 

cr(i^ 9 ,„) > 3.8676063n 2 - 8n, 

and for each fixed m > 9: 

cr(i\T„, „) m 

lim „\ m '"/ > 0.8594- 



n^oo Z(m,n) 771—1 

Thus we have an asymptotic approximation to Zarankiewicz's problem. 

The orbits of the action of G onto Z m x Z m can be identified as follows. 
Each orbit contains an element (cyj, t) with oq = (1, . . . , m). So there are at 
most (to — 1)! orbits. Next, (<7q,t) and ((To,t') belong to the same orbit if 
and only if r' = t 9 for some g G G that fixes oo . (There are only few of such 
g.) In this way, the orbits can be identified by computer, for to not too large. 
The corresponding values C a . T for each orbit [a, r], and the multiplication 
parameters, also can be found using elementary combinatorial algorithms. 
The computer does this all within a few minutes, for to < 9. But the resulting 
semidefinite programming problem took, in 2006, seven days. It was the largest 
SDP problem solved by then. 



6 Spherical codes 

Finding upper bounds for codes on the sphere is our next application. Here 
one deals with infinite-dimensional semidefinite programs which are invariant 
under the orthogonal group which is continuous and compact. So we are in 
the situation for which the techniques of Section 3 work. 

We start with some definitions: The unit sphere S n ~ 1 of the Euclidean 
space 1" equipped with its standard inner product x ■ y — X)i=i x *J/*j ^ s 
defined as usual by: 

S"- 1 ={(»!,..., i„)el": x- x = 1}. 

It is a compact space, on which the orthogonal group O n (M) acts homo- 
geneously. The stabilizer Stab(xo,0„(M)) of one point xo G 5 rl_1 can be 
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identified with the orthogonal group of the orthogonal complement (Rxo) -1 
of the line Rxo thus with O n _ 1 (R), leading to an identification between the 
sphere and the quotient space O„(R)/0„_i(R). The unit sphere is endowed 
with the standard 0„(M)-invariant Lebesgue measure fj, with the normaliza- 
tion fi^S"" 1 ) = 1. The angular distance d$(x,y) of (x,y) £ (5"" 1 ) 2 is defined 
by 

d$(x,y) — arccos(x • y) 

and is O n (R)-invariant. Moreover, the metric space (S n ,d$) is two-point 
homogeneous (see Section 3.4) under O n (R). 

The minimal angular distance dg(C) of a subset C C S*™ -1 is by definition 
the minimal angular distance of pairs of distinct elements of C . It is a classical 
problem to determine the maximal size of C, subject to the condition that 
dg(C) is greater or equal to some given minimal value m - m . This problem is the 
fundamental question of the theory of error correcting codes (see e.g. Conway, 
Sloane [26], Ericson, Zinoviev [37]). In this context, the subsets C C S*™ -1 are 
referred to as spherical codes. In geometry, the case 9 m m = tt/3 corresponds 
to the famous kissing number problem which asks for the maximal number of 
spheres that can simultaneously touch a central sphere without overlapping, 
all spheres having the same radius (see e.g. Conway, Sloane [26]). 

We introduce notations in a slightly more general framework. We say that 
C C S"- 1 avoids n C (S™- 1 ) 2 if, for all (x,y) € C 2 , (x,y) £ Q. We define, 
for a measure A, 

A(S n -\ Q, A) = sup (A(C) : C C S^ 1 measurable, C avoids Q). 

We have in mind the following situations of interest: 

(i) fl = {(x,y) : dg{x,y) €]0,# m i n [} and A is the counting measure denoted 
fi c . Then, the I?- avoiding sets are exactly the spherical codes C such that 

d$(C) > #mi n . 

(ii) fl = {(x, y) : dg(x,y) = 6} for some value 9^0, and A = /x. Here we 
consider subsets avoiding only one value of distance, so these subsets can 
be infinite. This case has interesting connections with the famous problem 
of finding the chromatic number of Euclidean space (sec Soifer [87], Bachoc 
ct. al. [11], Oliveira, Vallentin [79], Oliveira [78]) 

(hi) Q = {(x,y) : dg(x,y) G]0,6» min [ or (x,y) £ Cap(e,0) 2 }, and A = tt c . Here 
Cap(e, <j)) denotes the spherical cap with center e and radius <j>: 

Cap(e,0) = {x E S™- 1 : d g (e,x) < 4>) 

and we are dealing with subsets of a spherical cap with given minimal 
angular distance. 

The computation of A(S n ~ 1 , f2, A) is a difficult and unsolved problem in 
general, so one aims at finding lower and upper bounds for this number. 
To that end, for finding upper bounds, we borrow ideas from combinatorial 
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optimization. Indeed, if one thinks of the pair (S n , Q) being a graph with 
vertex set 5" _1 and edge set f2, then a set C avoiding Q corresponds to an 
independent set, and A(S n , fi, A) to the independence number of this graph. 
In general, for a graph r with vertex set V and edge set E, an independent set 
is a subset S of V such that no pairs of elements of S are connected by an edge, 
and the independence number a(r) of T is the maximal number of elements of 
an independent set. The Lovdsz theta number d(r) was introduced by Lovasz 
[69]. It gives an upper bound of the independence number a{r) of a graph 
r, and it is the optimal solution of a semidefinite program. We review Lovasz 
theta number in Section 6.1. Then, in Section 6.2, we introduce generalizations 
of this notion, in the form of conic linear programs, that provide upper bounds 
for A{S n ~ x , J?, A). Using symmetry reduction, in the cases (i), (ii), (hi) above, 
it is possible to approximate these conic linear programs with semidefinite 
programs, that can be practically computed when the dimension n is not too 
large. This step is explained in Section 6.5. It involves the description of the 
cones of G- invariant positive definite functions, for the groups G = O n (R) 
and G — Stab(xo, O n (K)). We provide this description, following the lines of 
Section 3, in Sections 6.3 and 6.4. Finally, in Section 6.6 we indicate further 
applications. 

6.1 Lovasz i? 

This number can be defined in many equivalent ways (see Lovasz [69] , Knuth 
[62]). We present here the most appropriate one in view of our purpose, which 
is the generalization to S n ~ 1 . 

Definition 6.1 The theta number of the graph r — (V,E) with vertex set 
V = {1, 2, . . . , n} is defined by 

i9(r) = max{(X, J n ) :XhO, (X,I n ) = l,X {j = for all (i,j) e E) (20) 

where I n and J n denote respectively the identity and the all- one matrices of 
size n. 

The dual program gives another expression of $(-T) (there is no duality 
gap here because X — I n is a strictly feasible solution of (20) so the Slater 
condition is fulfilled): 

i?(.T) = min {t : X > 0, X u =t-l, X t] = -1 for all (i,j) <£ E}. (21) 

We have already mentioned that this number provides an upper bound for the 
independence number a(r) of the graph r. It also provides a lower bound 
for the chromatic number x(-0 of the complementary graph r (the chromatic 
number of a graph is the minimal number of colors needed to color its vertices 
so that two connected vertices receive different colors); this is the content of 
the celebrated Sandwich theorem of Lovasz [691 : 
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Theorem 6.2 

a(r) < d(r) < X (F) (22) 

By modifying the definition of the theta number we get the strengthening 
a(r) < tf'(r) < i?(r), where 



(23) 



&'(r) = max {(X, J n ) :XhO, A > 0, 

(X, I n ) = 1, Xij = for all (*, j) e E}. 

Again, this program has an equivalent dual form: 

#'(r)=mm{t:XhO,X ii <t-l, ( 

Xij < -1 for all (i,j) <£E). K > 

6.2 Generalizations of Lovasz # to the sphere 

In order to obtain the wanted generalization, it is natural to replace in (23) 
the cone of positive semidefinite matrices indexed by the vertex set V of the 
underlying graph, with the cone of continuous, positive definite functions on 
the sphere, defined in Section 3.1. In fact, there is a small difficulty here due to 
the fact that, in contrast with a finite dimensional Euclidean space, the space 
C{X) of continuous functions on an infinite compact space cannot be identi- 
fied with its topological dual. Indeed, the topological dualC(X)* of C(A) (i.e. 
the space of continuous linear forms on C(X)) is the space Ai(X) of complex 
valued Borel regular measures on X (see e.g. Rudin [83, Theorem 6.19], Con- 
way [25, Appendix C]). Consequently, the two forms of $'(r) given by (23) 
and (24) lead in the infinite case to two pairs of conic linear programs. As we 
shall see, the right choice between the two in order to obtain an appropriate 
bound for A(S n ~ 1 , f2, A) depends on the set ft. 



Definition 6.3 Let fi c = { (x, y) : (x, y) £ Q and i^y}. Let 

ViiS 71 - 1 , Q) = inf { t : F > 0, F(x, x)<t-l, 

F{x,y) < -1 for all (x,y) e £2 C }, 



(25) 



and 



<? 2 (S™ _1 , ft) = sup { (F, 1) : F y 0, F > 0, 

(F,l A ) = l, (26) 

F(x,y) = Oforall(x,y)en }. 

In the above programs, F belongs to C((5"~ 1 ) 2 ). The notation F > stands 
for "F takes nonnegative values" . The function taking the constant value 1 is 
denoted 1, so that (F, 1) equals the integral of F over (S*™ -1 ) 2 . In contrast, 
with (F, 1/j) we mean the integral of the one variable function F[x,x) over 
A = {(x, x) : x <G S*™ -1 }. Let us notice that, since F is continuous in both 
programs, the sets fl and fl c can be replaced by their topological closures fi 
and fl c . 



36 C. Bachoc, D.C. Gijswijt, A. Schrijvcr, F. Vallentin 

A positive semidefinite measure on (S 1 " --1 ) 2 is one that satisfies (A, /) > 
for all / >z 0, where 



(A,/)= / f(x)d\(x). 
Jx 

and this property is denoted A >z 0. In a similar way a nonnegative measure 

A is denoted A > 0. 

Theorem 6.4 With the above notations and definitions, we have: 

1. If fi c n A = 0, then the program dual to di reads 

tfJOS" 1 - 1 , fi) = sup { (A, 1) : A h 0, A > 0, 

X(A) = 1,_ 
supp(A) C fi c U A } 

and 

A(S n -\ 12, ju c ) < MS n ~\ O) = $l{S n -\ fi). (27) 

2. If fi n A = 0, f/ien i/ie program dual to "&2 reads 

ti* 2 (S n -\n)=m{{ t: XhO, _ 

\<t(i A -n 2 over{S n - r ) 2 \T2 }, 

and 

A(s n -\ n, n) < Ms n ~\ n) = ^(fi* 1-1 , «). (28) 

Proof The dual programs are computed in a standard way (see Duffin [35], 
Barvinok [15]). In order to prove that there is no duality gap, one can apply 
the criterion [15, Theorem 7.2]. 

For the inequality (27), let C be a maximal spherical code avoiding fi. 
Then, the measure A = 8 C 2/\C\, where 5 denotes the Dirac measure, is a 
feasible solution of d\ with optimal value |C| = A(S n ~ 1 , fi,fi c )- 

For the inequality (28), let (t, A) be a feasible solution of i^- Then, if C 
avoids fi, C 2 C (S*™- 1 ) 2 \7X Thus, < A(C 2 ) < t/*(C) - m(C) 2 , leading to 
the wanted inequality /j,(C) < t. D 

Example 6.5 In the situations (i) and (ii) above, the set fi fulfills the con- 
dition 1. of Theorem 6.4 while for (Hi) we are in the case 2. 

6.3 Positive definite functions invariant under the full orthogonal 
group 

It is a classical result of Schoenberg [84] that these functions, in the variables 
(x, y) G {S n ~ 1 ) 2 , arc exactly the nonnegative linear combinations of Gegen- 
bauer polynomials (Section 3.4) evaluated at the inner product x-y. We briefly 
review this classical result, following the lines of Section 3. 

The space Hom£ of polynomials in n variables X\, . . . ,x n which are ho- 
mogeneous of degree k affords an action of the group O n (W) acting linearly 
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on the n variables. The Laplace operator A = J^"_ 1 -4-^ commutes with this 
action, thus the subspace Harm)! defined by: 

Harm£ = {?£ HomJ! : AP = 0} 

is a representation of 0„(R) which turns out to be irreducible (see e.g. An- 
drews, Askey, Roy [1]). Going from polynomials to polynomial functions on 
the sphere, we introduce H£, the subspace of C(S n ~ 1 ) arising from elements 
of Harm)!. Then, for d > k, H'£ is a subspace of the space Vd = Pol<d(5 rl_1 ) 
of polynomial functions on S n ^ 1 of degree up to d. We have: 

Theorem 6.6 The irreducible O n (R)- decomposition o/Pol<d(>S'"~ 1 ) is given 
by: 

Pol^S"- 1 ) = # " J_ F™ J_ • • • J_ H d \ (29) 

fc-l\ /n+fc-3\ 



where, for all k > 0, HJ! ~ Harm)! is of dimension ft,)! = ( fe ) — ( 
The function Yk(dg(x,y)) associated to H£ following (14) equals: 



fc-2 



Y k (de(x,y)) = h n k Pi (n - 3)/2 ^- 3)/2 \ x ■ y). (30) 

Proof. For the sake of completeness we sketch a proof. Because S*™ -1 is two- 
point homogeneous under 0„(R), the algebra (V^)^ related to the finite 
dimensional O n (R)-space Vd is equal to the set of polynomial functions in the 
variable t = x ■ y of degree up to d, thus has dimension d + 1. On the other 
hand, it can be shown that the d+ 1 subspaces H£, for < k < d, are non zero 
and pairwise distinct. Then, the statement (29) follows from the equality of 
the dimensions of (Vd)^ and of End "' (V^) which are isomorphic algebras 
(12). It is worth to notice that we end up with a proof that the spaces H£ are 
irreducible, without referring to the irreducibility of the spaces Harm)! (which 
can be proved in a similar way). The formula for hi}, follows by induction. 

The functions Yk(dg/ X y \) associated to the decomposition (29) must be 
polynomials in the variable x ■ y of degree k; let us denote them temporary 
by Qk(t)- A change of variables shows that, for a function f(x) € C(S n ~ 1 ) of 
the form f(x) = (p(x ■ y) for some y e S n , 

f(x)dfi(x) =c n f tp(t)(l - t 2 Y n -V/ 2 dt 

s™- 1 J-l 

for some constant c n . Then, because the subspaces HJ} are pairwise orthogo- 
nal, the polynomials Qk must satisfy the orthogonality conditions 



l 



Q k (t)Qi(t)(l-t 2 Y n -V/ 2 dt = 

for all k =^ I thus they must be equal to the Gegenbauer polynomials up to a 
multiplicative factor. Integrating over S*™ -1 the formula (9) when x — y shows 
that Yfe(0) = hk thus computes this factor. □ 
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Corollary 6.7 Let FQ(t) = p«»-3)/2,(n-3)/2)^_ Then> F G C(S™-i)g;W if 
and only if 

F(x, y) = Yl f kP k( x ■ w)> w ^ A ^ ° f° r aU k ^°- ( 31 ) 

fc>0 

6.4 Positive definite functions invariant under the stabilizer of one 
point 

We fix an element e e S" 1 " 1 and let G = Stab(e, O n (R)). Then, the orbit O(x) 
of x e S*™ -1 under G is the set 

0{x) ={yeS n - 1 :e-y = e-x}. 

The orbit 0((x,y)) of (x,y) e S 1 " -1 equals 

0((x, y)) = {(z, i) e (S"" 1 ) 2 : (e • z, e • £, z • t) = (e • x, e • y, x • y)}. 

In other words, the orbits of G on (S*™ -1 ) 2 are parametrized by the triple of 
real numbers (e ■ x,e ■ y,x ■ y). So the G-invariant positive definite functions 
on S 1 ™ -1 are functions of the three variables u = e-x,v — e-y,t — x-y. Their 
expression is computed in Bachoc, Vallentin [6]. 

Theorem 6.8 The irreducible decomposition ofVd = Pol<d{S n ) under the 
action of G = Stab(e, S n ~ 1 ) ~ 0„_i(R) is given by: 

d 

PoUdiS"- 1 ) ~ 0(Harm^- 1 ) d - fe+1 . (32) 

fc=0 

The coefficients of the matrix Yk = Y k n of size d—k + 1 associated to Harm)? - 
are equal to: 

Y^ tij {u,v,t) = u i v i Q^- 1 (^v,t) (33) 

where < i, j < d — k, and 

grw> - (d - a )d - ^))"vr-( 7 =i==_) 

Proof We refer to Bachoc, Vallentin [6] for the details. In order to obtain 
the G-decomposition of the space Pol< c ;(S'™ _1 ) we can start from the O n (R)- 
decomposition (29). The 0„(R)-irreducible subspaces H% split into smaller 
subspaces following the 0„_i(R)-isomorphisms (see e.g. Vilenkin, Klimyk 

[98]): 

k 

Harm£ ~ Harm^ 1 (34) 

4=0 

leading to (32). The expression (33) follows from the definition (9) of Ek,ij(x, y) 
and from the construction of convenient basis (ek,i, s )s- These basis arise from 
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explicit embeddings of H^~ x into V4, defined as follows. For x € 5™ _1 , let 
x = ue + y/1 - u 2 C where ( e (Re) 1 - and ( ■ £ = 1. Let p^j send / e H^ 1 to 
<Pk,i(f) where 

¥>*,*(/)(*) = «*(1 - « 2 ) fc/2 /(0- 

Then, {ek,i, s '■ 1 < s < fe)!~ } is taken to be the image by <pk,i of an orthonor- 
mal basis of H^ 1 . D 

6.5 Reduction using symmetries 

If the set Q is invariant under a subgroup G of 0*„(R), then the feasible 
sets of the conic linear programs (25) and (26) can be restricted to the G- 
invariant positive definite functions F . Indeed, symmetry reduction of finite 
dimensional scmidefinite programs extends to infinite compact spaces, with 
the now familiar trick that replaces the finite average over the elements of a 
finite group by the integral for the Haar measure of the group. We apply this 
general principle to the programs (25) and (26) and we focus on the cases 
(i)-(iii) above. 

Case (i): fl — {(x,y) : d g (x,y) £]0, min [}. 

The set fl is invariant under O n (W). According to Theorem 6.4 we consider 
the program (25) where F >z can be replaced by (31). The program $1 
becomes after a few simplifications: 



^ 1 (5"- 1 ,^)=inf{ I + ^/ fc :/ fe >0, 



k - 1 (35) 

i + ^/ fe P fe "(*)<o te[-M]}, 

fc>i 

where s = cos(# m i n ). One recognises in (35) the so-called Delsarte linear pro- 
gramming bound for the maximal number of elements of a spherical code with 
minimal angular distance # m in, see Delsarte, Goethals, Seidel [33], Kabatian- 
sky, Levenshtein [51], Conway, Sloane [26, Chapter 9]. The optimal value of 
the linear program (35) is not known in general, although explicit feasible 
solutions leading to very good bounds and also to asymptotic bounds have 
been constructed (Kabatiansky, Levenshtein [51], Levenshtein [67], Odlyzko, 
Sloane [77]). Moreover, this infinite dimensional linear program can be effi- 
ciently approximated by semidefinite programs defined in the following way: 
in order to deal with only a finite number of variables /&, one restricts to 
k < d (it amounts to restrict to G-invariant positive definite functions of 

V d ). Then the polynomial 1 + X^fe=i fk^k 1S required in (35) to be non- 
positive over a certain interval of real numbers. By the theorem of Lukacs 
concerning non-negative polynomials (see e.g. Szego [91, Chapter 1.21], this 
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can be expressed as a sums of square condition, hence as a semidefinite pro- 
gram. Then, when d varies, one obtains a sequence of semidefinite programs 
approaching ?9 1 (5 n ~ 1 , fl) from above. 

Case (ii): fl — {(x,y) : d e (x,y) — 6}. 

The set fl is again invariant under O n (M) but now we deal with (26), which 
becomes: 

MS n -\ f2) = sup { /„ : h > o, J2 h = i, 



^ , (36) 

£/ fe p fe "( s ) = o} 

k>0 

where s — cos(9). This linear program has infinitely many variables but only 
two constraints. Its optimal value turns to be easy to determine (we refer to 
Bachoc et. al. [11] for a proof). 

Theorem 6.9 Let m(s) be the minimum of P£(s) for k = 0, 1, 2, . . . . Then 

m(s) 



Ms n -\n) = 



m(s) — 1 



Case (iii): fl — {(x,y) : d g (x,y) 6]0,6> min [ or (x,y) £ Cap(e, <j>) 2 }. 

This set is invariant by the smaller group G = Stab(e,0„(R)). Like in case 
(i), the program §\ must be considered and in this program F can be assumed 
to be G- invariant. 

From Stone- Weierstrass theorem (see e.g. Conway [25, Theorem 8.1]), the 
elements of C{{S n ~ 1 ) 2 ) can be uniformly approximated by those of Vj . In 
addition, one can prove that the elements of C((S n ~ 1 ) 2 )yo can be uniformly 

~ (2) 

approximated by positive definite functions belonging to V^ . We introduce: 

#{ d \S n -\S?)=m{{t:F€(VP) h0 , 

F(x,x)<t-1, (37) 

F(x,y) < -1 for all (x,y) £ Q c }. 

So, #i(S' Tl - 1 ,r2) < d[ d) (S n -\n), and the limiting value of i^S™" 1 , Q) 
when d goes to infinity equals d\(S n ~ l , Q). Since fl and Vd are invariant by 
G, one can moreover assume that F is G-invariant. From Theorems 3.3 and 
6.8, we have an expression for F: 



F(x,y) = ^2(F k ,Y k (u,v,t)), F k h0, 

fe=0 
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where the matrices F k are of size d — k + 1. Replacing in d[ ' leads to: 

${ d) (S™- 1 , fl) =ini{t:F k hO, 

d 

Y,{F k ,Y k {u,u,l))<t ue[s',l], 

fe=0 
d 



J2(Fk,Y k (u,v,t)))<-l s'<u<v<l 

-1 < t< s} 



k=0 



with s — cos(9) and s' = cos((f>). The left hand sides of the inequalities are 
polynomials in three variables. Again, these constraints can be relaxed using 
sums of squares in order to boil down to true semidefinite programs. We refer 
to Bachoc, Vallentin [7] for the details, and for numerical computations of 
upper bounds for codes in spherical caps with given minimal angular distance. 

6.6 Further applications 

In Bachoc, Vallentin [6] it is shown how Delsarte linear programming bound 
[29] can be improved with semidefinite constraints arising from the matrices 
Y k (33). The idea is very much the same as for the Hamming space given in 
Schrijver [86] and explained in Section 4 : instead of considering constraints 
on pairs of points only, one exploits constraints on triples of points. More pre- 
cisely, if S k (u,v,t) denotes the symmetrization of Y k (u,v,t) in the variables 
(u, v, t), then the following semidefinite property holds for all spherical code 
C: 

J2 SUx-y,yz,z-x)hO- (38) 

{x,y,z)eC 3 

From (38), it is possible to define a semidefinite program whose optimal value 
upper bounds the number of elements of a code with given minimal angu- 
lar distance. In Bachoc, Vallentin [6], Mittclmann, Vallentin [73], new upper 
bounds for the kissing number have been obtained for the dimensions n < 24 
with this method. We give next a simplified version of the semidefinite pro- 
gram used in [6]. Another useful version is given in Bachoc, Vallentin [9] for 
proving that the maximal angular distance of 10 points on S 3 is cos(l/6). 

Theorem 6.10 The optimal value of the semidefinite program: 

inf{ l + (F Q ,J d+1 ):F k hO 
d 

Y,(F k ,SZ{u lU ,l))<- T -\<u<s 



fc=0 
d 



3 (39) 



^<F fe ,5£(w,M)><0, -l<u,v,t<s} 



fc=0 
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is an upper bound for the number A(S n , Q, /i c ) where Q = {(x,y) G 
(S*™ -1 ) 2 : s < x-y < 1}, i.e. for the maximal number of elements of a spherical 
code with minimal angular distance at least equal to min ~ arccos(s). 

Proof. Let (Fq, . . . , Ft) a feasible solution of (39). Let 

d 

F(x,y,z) = ^2(F k ,S%(x-y,y- z,z-x)). 

k=0 

If C is a spherical code, we consider E = J2( x v z )ec 3 F{x, V-> z )- We nave: 
0<i7=^F(x,x,x)+ J2 F(x,x,y)+ ^ F(x,y,z) 

xeC \{x,y,z}\=2 \{x,y,z}\=3 

where the inequality holds because of (38). Then, taking S£ (1, 1, 1) = Jd+i 
and ££(1,1,1) = for k > 1 into account, we have F(x,x,x) — (i*o, Jd+i)- 
If moreover do(C) > m i n , we can apply the constraint inequalities of the 
program to the second and third terms of the right hand side. We obtain: 

o<i:<(^o,J d+ i)|q-|q(|q-i) 

leading to the inequality \C\ < 1 + (Fq, Jd+i)- □ 



7 Sums of squares 

A fundamental task in polynomial optimization and in real algebraic geom- 
etry is to decide and certify whether a polynomial with real coefficients in n 
indctcrminatcs can be written as a sum of squares: Given p € R[xi, . . . , x n ] 
do there exist polynomials qi, . . . , q m <G R[x 1: . . . , x n ] so that 

2 i 2 i i 2 O 

P = <?i + 9 2 H 1" 1 m '■ 

This problem can be reformulated as a semidefinite feasibility problem: Let 
z be a vector containing a basis of M[xi, . . . , x n ]<d the space of polynomials 
of degree at most d. For example, let z be the vector containing the monomial 
basis 

Z = (1, Xl, X2, ■ • ■ , X n , Xi, X1X2, X 2 , • • • , x n ) 

which has length ("J ). A polynomial p £ R[xi,...,x n ] of degree 2d is a 
sum of square if and only if there is a positive semidefinite matrix X of size 
( n ~d d ) x ( n ~d~ d ) so tnat tne ( n td d ) nnear — linear in the entries of X — equations 

p(xi,. . . , x n ) = z T Xz 

hold. 

This semidefinite feasibility problem can be simplified if the polynomial p 
has symmetries. The method has been worked out by Gatermann and Parrilo 
in [38]. In this section we give the main ideas of the method. For details and 
further we refer to the original article. 
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7.1 Basics from invariant theory 

We start by explaining what we mean that a polynomial has symmetries. 
Again to simplify the presentation of the theory we consider the complex case 
only. 

Let G be a finite group acting on C™. This group action induces a group 
action on the polynomial ring C[xi, . . . , x n ] by 

{gp)(xi,. ..,£„)= p(g~ l (x!, . . .,»„)), 

and we say that a polynomial p is G-invariant if gp — p for all g G G The set 
C[xi, . . . , x n ] G of all G-invariant polynomials is a ring, the invariant ring. By 
Hilbert's finitcness theorem it is generated by finitely many G-invariant poly- 
nomials. Even more is true: Since the invariant ring has the Cohen-Macaulay 
property it admits a Hironaka decomposition: There are G-invariant polyno- 
mials rji, 9j G C[xi, . . . , x n ] so that 

r 

C[xi, . . . , x n f = jfcCpi, ...,$,}, (40) 

8=1 

hence, every invariant polynomial can be uniquely written as a polynomial in 
the polynomials r\i and 6j where rji only occurs linearly. We refer to Sturmfcls 
[90, Chapter 2.3] for the definitions and proofs; we only need the existence of 
a Hironaka decomposition here. 



7.2 Sums of squares with symmetries 

We consider the action of the finite group G restricted to the (™J ) -dimensional 
vector space of complex polynomials of degree at most d. This defines a uni- 
tary representation 

7r : G ->• Gl(C[xi, . . . , x n ]<d). 

From now on, by using the monomial basis, we see ir(g) as a regular matrix 



in 



c m><m. 



Example 7.1 For instance, the matrix g =(34) acts on C 2 and so on the 
polynomial p = 1 + x\ + X2 + x\ + X1X2 + x\ e C[xi,X2]<2 by 

(gp){xi,x 2 ) =p{xi +2x 2 ,3xi +4x 2 ) 

= 1 + (xi + 2x 2 ) + (3xi + 4x 2 ) + (xi + 2x 2 ) 2 

+ (xi + 2x 2 )(3xi + 4x 2 ) + (3xi + 4x 2 ) 2 
= 1 + (xi + 2x 2 ) + (3xi + 4x 2 ) + (x 2 + 4xix 2 + 4x^) 

+ (3x 2 + 10xix 2 + 8x 2 ) + (9x 2 + 24xi.x 2 + 16x 2 ). 

and so defines the matrix 
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Ad) 



/1000 0\ 

130 

0240 

00013 9 

4 10 24 
\0 004 8 16/ 

Let p € R[xi, . . . , x n ] be a polynomial which is a sum of squares and which 

is G-invariant. Thus we have a positive semidefinite matrix leRi d > x \ d ) 
so that 

p(x\, . . . , x n ) — z Xz = z*Xz, 

and for every g £ G we have 

gp(xi,...,X n ) = (ir(g)*z)*X(ir(g)*z) = z*ir(g)Xir(g)* z. 

Hence, X is G-invariant and lies in I Ca d / X V * ) j t the commutant algebra 

of the matrix *-algebra spanned by the matrices Tr(g) with g G G. So by 
Theorem 2.7 there are numbers D, mi, . . . , mjj and a *-isomorphism 

D 



9 



: ( C ("*' 



)*("**) 



0< 

fe=l 



Hence, cf. Step 2 (second version) in Section 1.2, we can write the polynomial 
p in the form 

(D m k \ 

y^ y^ x kiUV (p^ i (E kiUV ) j z 
k=\u,v=\ / 

with Z) positive semidefinite matrices 



-A-fc — ^fc,uv/i< u t 



k = 1, 



,£>. 



We define D matrices E 1: . ..,E D E (C[xi, . . . , x„] G )" lfc x " lfc with G-invariant 
polynomial entries by 



(-Efc)m, = (<p 1 ( E k,uv))^ x °-i,Yi z xf* II 



a,+ft 



d / v d / as matrices whose rows and 
columns are indexed by monomials FT . a;™* . Then, the polynomial p has a 
representation of the form 



D 



P 



(xi,...,x n ) = y^(x fc , ^ fc ). 



fc=i 



Since the entries of E/~ are G-invariant polynomials we can use a Hironaka 
decomposition to represent them in terms of the invariants r\i and 6j. We 
summarize our discussion in the following theorem. 
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Theorem 7.2 Let p be a G-invariant polynomial of degree 2d which is a sum 
of squares. Then there are numbers D, mi, . . . , mjj so that p has a represen- 
tation of the form 

D 

p(xi,...,x„) = ^2(X k ,E k ), 

k=l 

where X k G C m fc xm *; are positive semidefinite Hermitian matrices, and where 

( r \ m k xm k 

are matrices whose entries are polynomials (determined by a Hironaka decom- 
position of C[xi, . . . ,x n ] G and by the ^-isomorphism ip). 



8 More applications 

In the last years many results were obtained for semidefinite programs which 
are symmetric. This was done for a variety of problems and applications. In 
this final section we want to give a brief, and definitely not complete, guide 
to the extensive and growing literature. 

8.1 Interior point algorithms 

Kanno, Ohsaki, Murota, Katoh [52] consider structural properties of search 
directions in primal-dual interior-point methods for solving invariant semidef- 
inite programs and apply this to truss optimization problems, de Klerk, 
Pasechnik [57] show how the dual scaling method can be implemented to 
exploit the particular data structure where the data matrices come from a 
low-dimensional matrix algebra. 

8.2 Combinatorial optimization 

Using symmetry in semidefinite programs has been used in combinatorial op- 
timization for a variety of problems: quadratic assignment problem (dc Klerk, 
Sotirov [61]), travelling salesman problem (de Klerk, Pasechnik, Sotirov [60]), 
graph coloring (Gvozdenovic, Laurent [45], [46], Gvozdenovic [44]), Lovasz 
theta number (de Klerk, Newman, Pasechnik, Sotirov [56]). 

8.3 Polynomial optimization 

Jansson, Lasserre, Riener, Theobald [50] work out how constrained polyno- 
mial optimization problems behave which are invariant under the action of 
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the symmetric group or the cyclic group. Among many other things, Cim- 
pric, Kuhlmann, Scheiderer [24] extend the discussion of Gatermann, Parrilo 
[38] from finite groups to compact groups. Cimpric [23] transfers the method 
to compute minima of the spectra of differential operators. In [18] Bossc 
constructs symmetric polynomials which are non-negative but not sums of 
squares. 

8.4 Low distortion geometric embedding problems 

Linial, Magen, Naor [68] give lower bounds for low distortion embedding of 
graphs into Euclidean space depending on the girth. Vallentin [95] finds ex- 
plicit optimal low distortion embeddings for several families of distance regular 
graphs. Both papers construct feasible solutions of semidcfinitc programs by 
symmetry reduction and by using the theory of orthogonal polynomials. 

8.5 Miscellaneous 

Bai, dc Klerk, Pasechnik, Sotirov [12] exploit symmetry in truss topology 
optimization and Boyd, Diaconis, Parrilo, Xiao [19] in the analysis of fast 
mixing Markov chains on graphs. 

8.6 Software 

Pasechnik, Kini [80] develop a software package for the computer algebra 
system GAP for computing with the regular ^-representation for matrix *- 
algebras coming from coherent configurations. 

8.7 Surveys and lecture notes 

Several surveys and lecture notes on symmetry in semidefinite programs with 
different aims were written in the last years. The lecture notes of Bachoc [5] 
especially discuss applications in coding theory and extend those of Vallentin 
[97] which focuses on aspects from harmonic analysis. The survey [54] of de 
Klerk discusses next to symmetry also the exploitation of other structural 
properties of semidefinite programs like low rank or sparsity. 
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